WHITEHEAD INSTITUTE/MIT CENTER FOR GENOME RESEARCH
HUMAN GENOMIC MAPPING PROJECT
DATA RELEASE 7 (MAY 1995)
The seventh release of data from the Human Physical Mapping
Project at the Whitehead Institute/MIT Genome Center, covering data
generated through the end of May, 1995, is now available.
This data release contains YAC screening data for 10,028 sequence tagged
sites (STSs) screened on the CEPH mega-YAC library. For each STS, we
report addresses for the YACs found to contain the STS. From the data
obtained so far, there are over 1100 contigs assembled using double
linkage between STSs.
The data is available electronically in two ways.
ANONYMOUS FTP: The entire data release is available as a set of
Microsoft Excel files and tab-delimited ascii files on our ftp
server. Using an ftp client (such as "Fetch" on the Macintosh),
Use "anonymous" as your user name, and give your e-mail address as your
password. The data files are present in the directory
The contents are as follows:
05-95.INTRO.mswd.hqx - Description of the data release, Macintosh format.
05-95.INTRO.txt - The same as ascii text
05-95.STS2YAC.sea.hqx - STS & YAC screening data, in MS-Excel format.
05-95.STS2YAC.txt - The same as tab-delimited text.
05-95.YAC2STS.txt - Inverse map of YAC to STS screening data
05-95.CONTIG2STS.txt - Doubly-linked contig information
05-95.sequence.txt - Full sequences of previously unpublished STSs.
THE WORLD-WIDE WEB: You will need a World Wide Web client such as
Mosaic (Unix, MS-Windows and Macintosh) or MacWeb (Macintosh).
Instruct your client to connect to
>From there, follow the "Human Physical Mapping Project" link. You will be
able to browse and download the raw data set as well as to view
A subset of the STSs (those for which we have chromosomal assignments)
are also available through the Genome Database (GDB). All STSs are
also submitted to GenBank.
QUESTIONS AND PROBLEMS. If users have any questions or problems,
please contact us at human_STS_help at genome.wi.mit.edu We invite
suggestions about how to make these data release most useful.
DATA RELEASE POLICY AND CITATION. Data releases are scheduled monthly.
At the end of each month, all genomic mapping
data are reviewed and prepared for distribution via CGR's electronic
databases. Data releases typically occur within a week of the
close of the month. Releases are announced by electronic messages
posted to the following two newsgroups: "bionet.genome.chromosomes" and
CGR's data release policy aims to ensure that scientific colleagues
have immediate access to information that may assist them in the
search for genes. Data releases do not constitute scientific
publication of CGR's work, but rather provide scientists with a
regular look into our lab notebooks. For projects aimed at the
analysis of particular genes or subchromosomal regions, permission is
hereby granted to use our data without the need for a formal
collaboration, subject only to appropriate acknowledgment. For
projects aimed at large-scale mapping of entire chromosomes or entire
genomes, use of the data and markers should be on a collaborative
The information for the human genome mapping project should be cited
as: Whitehead Institute/MIT Center for Genome Research, Human Physical
Mapping Project, Data Release 7 (May 1995).
Lincoln D. Stein Whitehead Institute/MIT Genome Center
lstein at genome.wi.mit.edu Cambridge, MA 02142