Yeast - Reference Chromosome II


EMBO J 13: 5795-5809 (1994) [95112788]

Complete DNA sequence of yeast chromosome II.

Feldmann H., Aigle M., Aljinovic G., Andre B., Baclet M.C., Barthe C., Baur A., Becam A.M., Biteau N., Boles E., Brandt T., Brendel M., Brueckner M., Bussereau F., Christiansen C., Contreras R., Crouzet M., Cziepluch C., Demolis N., Delaveau T., Doignon F., Domdey H., Duesterhus S., Dubois E., Dujon B., El Bakkoury M., Entian K.D., Feuermann M., Fiers W., Fobo G.M., Fritz C., Gassenhuber H., Glansdorff N., Goffeau A., Grivell L.A., de Hann M., Hein C., Herbert C.J., Hollenberg C.P., Holmstrom K., Jacq C., Jacquet M., Jauniaux J.C., Jonniaux J.L., Kallesoe T., Kiesau P., Kirchrath L., Koetter P., Korol S., Liebl S., Logghe M., Lohan A.J.E., Louis E.J., Li Z.Y., Maat M.J., Mallet L., Mannhaupt G., Messenguy F., Miosga T., Molemans F., Mueller S., Nasr F., Obermaier B ., Perea J., Pierard A., Piravandi E., Pohl F.M., Pohl T.M., Potier S., Proft M., Purnelle B., Ramezani Rad M., Rieger M., Rose M., Schaaff-Gerstenschlaeger I., Scherens B., Schwarzlose C., Skala J., Slonimski P.P., Smits P.H.M., Souciet J.L., Steensma H.Y., Stucka R., Urrestarazu A., Van der Aart Q.J.M., van Dyck L., Vassarotti A., Vetter I., Vierendeels F., Vissers S., Wagner G., de Wergifosse P., Wolfe K.H., Zagulski M., Zimmermann F.K., Mewes H.W., Kleine K.

Institut fur Physiologische Chemie, Physikalische Biochemie und Zellbiologie, Universitat Munchen, Germany.

In the framework of the EU genome-sequencing programmes, the complete DNA sequence of the yeast Saccharomyces cerevisiae chromosome II (807 188 bp) has been determined. At present, this is the largest eukaryotic chromosome entirely sequenced. A total of 410 open reading frames (ORFs) were identified, covering 72% of the sequence. Similarity searches revealed that 124 ORFs (30%) cor respond to genes of known function, 51 ORFs (12.5%) appear to be homologues of genes whose functions are known, 52 others (12.5%) have homologues the functions of which are not well defined and another 33 of the novel putative genes (8%) exhibit a degree of similarity which is insufficient to confidently assign function. Of the genes on chromosome II, 37-45% are thus of unpredicted function. Among the novel putative genes, we found several that are related to genes that perform differentiated functions in m ulticellular organisms of are involved in malignancy. In addition to a compact arrangement of potential protein coding sequences, the analysis of this chromosome confirmed general chromosome patterns but also revealed particular novel features of chromosomal organization. Alternating regional variations in average base composition correlate with variations in local gene density along chromosome II, as observed in chromosomes XI and III. We propose that functional ARS elements are preferably located in the A T-rich regions that have a spacing of approximately 110 kb. Similarly, the 13 tRNA genes and the three Ty elements of chromosome II are found in AT-rich regions. In chromosome II, the distribution of coding sequences between the two strands is biased, with a ratio of 1.3:1. An interesting aspect regarding the evolution of the eukaryotic genome is the finding that chromosome II has a high degree of internal genetic redundancy, amounting to 16% of the coding capacity.

Note: The current version of chr omosome II has been updated at several positions. However, the main change on chromosome II since its release in 1994 has been the addition of the sequence of the left telomere. See the WWW pages at for more information and details of the sequence changes.

EMBL accession numbers: Y08934, Z35762-Z36171