Annual Progress Report
National Research Support Project- NRSP-8

Year Ending 2002

Craig W. Beattie, Anette Rink

Radiation Hybrid Mapping project - IMpRH7000rad:

A total of 2,035 ESTs from immune tissues and placentae were used to construct an RH map using the IMpRH7000Rad panel (Yerle et al., 1998; Hawken et al., 1999) and the MAXLIK option of RHMAP V3 (Boehnke et al., 1996) at a 2pt LOD of 6 or 8. The number of observed/expected markers per chromosome ranged from 0.53 for SSC11 to 2.02 for SSC12, respectively. Average marker retention frequency was 30.3%. Annotated ESTs represent 47% of the markers, 30% ESTs showed homology to ESTs from a variety of species, 20% represent novel sequences (ns) with no significant match in any nucleotide or protein database, 3% of the ESTs only identify open reading frames (ORFs) in the human genomic sequence (HGS). More than 28% of the ESTs assigned to the physical map of the pig did not find a homologous sequence in V30 of the human genome assembly. This is indicative of assembly gaps in the HGS.

Figure 2. Genome 'Deserts' and Gene Rich Regions in the Porcine Genome. Distribution of 2,035 mapped ESTs. The columns on the left (pink) represent the percentile of DNA in each individual chromosome (Schmitz et al., 1992). Blue columns represent the percentiles of markers mapped. SSC 1, 8, 11 and X (indicated by arrows) correspond to HSA 18, 4, 13 and X, respectively and represent 'gene deserts' both in the human and the porcine genomes. A total of six porcine chromosomes (SSC2, SSC5, SSC6, SSC7, SSC12 and SSC14) are syntenic with the three gene-richest human chromosomes, HSA17, 19 and 22.

The Second Generation EST Radiation Hybrid (RH7000rad) Map (2035 ESTs).

The, as yet unpublished, Second Generation EST Radiation Hybrid Map confirms 98% coverage of the human genome. Currently ~ 60 breaks between chromosomes and ~70 breaks within regions of homology have been detected. A final number cannot be determined at this time since the orientation of a significant number of contigs in HGS Build 29 remains to be determined. Additionally, 4.9% of markers in the current map identify other homologous human chromosomes in HGS Build 29 vs 5.4% in the 1st Generation EST Radiation Hybrid Map. The current porcine RH map extends beyond the genetic linkage map on 38 telomeres. 28.9% of all markers do not find orthologous sequences in the HGS Build 29 vs 23.6% in HGS Build 21, indicating an increasing number of deletions, particularly of telomeric sequences, in subsequent builds of the human genome sequence. The human genome sequence assembly currently does not extend beyond the human RH map(s).

SSC123456789101112131415161718X
Total number of EST markers mapped1601811381241421651576911451371121401289848514872
Genes55743854635481274617114062523423201237
ESTs70777246208246354923185354534316241820
novel sequences3530282459293071911819242321971815
Number of linkage groups per chromosome157853856636112465127
Number of syntenic chromosomes5332243133112421111
ESTs that find matches outside syntenic chromosomes where synteny is based on chromosomal paints692351657533105811803
Markers that do not find matches in the HGS V294759562531594518311512294738191591716
Genes2125576122111257743222
ESTs292428814291312107514231456567
novel sequences1623231210242041075101717106297
Table 5. The 2pt LOD score criterion for creating the 2nd generation porcine EST radiation hybrid map using RHMAP V3 (Boehnke et al., 1996) was set at 6. Good statistical support for local gene order was considered to be more valuable than creating large linkage groups and obtaining complete chromosome coverage through a minimum number of linkage groups. This is particularly important in those regions where the human genome sequence is not fully assembled or ambiguous. A total of 588 ESTs (~29%) do not identify sequences with significant homology in HGS Build 29. An additionally 4.9% ESTs (100 markers) identify significant matches on human chromosome fragments that do not correspond to the respective porcine chromosome based on bi-directional painting results (Goureau et al., 1996; Pinton et al., 2000).

We are currently using Carthagene (http://www.inra.fr/bia/T/schiex/Export/WABI.pdf) to calculate an updated version of the 2nd Generation pig human comparative map (IMpRH7000Rad panel). Markers that introduced new homologies to the comparative map are reevaluated based on their multipoint linkage assignment (using the IMpRH mapping tool), rather than on their 2pt LOD value. These maps will be transferred to the FTP site as they become available. An Excel file (Complete Info for 2035 ESTs.xls) containing all pertinent information (annotation, primers, location in HGS build 29, mapping information) is also available from the FTP site. Build 30 of the HGS was released June 24th, 2002 and covers > 70% of the human genome in currently 1,736 contigs assembled, covering 2.87 billion base pairs. BLASTN comparisons against the contig assembly and dbEST have been completed. BLASTP comparisons of the dataset against the CCBC protein database set and nr_NCBI are in progress. Additionally plates MOA131-151 from additional sequencing efforts of the apicomplexin parasite Toxoplasma gondii and its effects on swine epithelial cells, in our laboratory, will be compared against the 1st draft of the Plasmodium falciparum genome as soon as the dataset is available for downloading. An update of both datasets, parasite and (swine) host response genes, is currently under way. We expect to map a significant number of these host response ESTs on the IMpRH12000rad panel.

Radiation Hybrid Mapping project - IMpRH12000rad: We have assigned 102, 138, 124 and 59 ESTs to SSC12, 7, 13 and X, respectively to the IMNpRH212,000-rad panel. These have been finalized and will be available on our web sites (www.toulouse.inra.fr/lgc/lgc/, http://www.ag.unr.edu/ab/standard.htm), shortly. In addition, we have mapped and are finalizing 910 ESTs on SSC1-6 and 68 on SSC8 (current total ~ 1400) that should also be available online shortly, at the above web sites.