Prediction of the coding sequences of unidentified human genes. XXII. The complete sequences of 50 new cDNA clones which code for large proteins.
Article Details
- CitationCopy to clipboard
Nagase T, Kikuno R, Ohara O
Prediction of the coding sequences of unidentified human genes. XXII. The complete sequences of 50 new cDNA clones which code for large proteins.
DNA Res. 2001 Dec 31;8(6):319-27.
- PubMed ID
- 11853319 [ View in PubMed]
- Abstract
As an extension of human cDNA projects for accumulating sequence information on the coding sequences of unidentified genes, we herein present the entire sequences of 50 cDNA clones, named KIAA1939-KIAA1988. cDNA clones to be entirely sequenced were selected by two approaches based on their protein-coding potentialities prior to sequencing: 10 cDNA clones were chosen because their encoding proteins had a molecular mass larger than 50 kDa in an in vitro transcription/translation system; the remaining 40 cDNA clones were selected because their putative proteins-as determined by analysis of the genomic sequences flanked by both the terminal sequences of cDNAs using the GENSCAN gene prediction program-were larger than 400 amino acid residues. According to the sequence data, the average sizes of the inserts and corresponding open reading frames of cDNA clones analyzed here were 4.6 kb and 1.9 kb (643 amino acid residues), respectively. From the results of homology and motif searches against the public databases, the functional categories of the 31 predicted gene products could be assigned; 25 of these predicted gene products (81%) were classified into proteins relating to cell signaling/communication, nucleic acid management, and cell structure/motility. The expression profiles of the genes were also studied in 10 human tissues, 8 brain regions, spinal cord, fetal brain and fetal liver by reverse transcription-coupled polymerase chain reaction, the products of which were quantified by enzyme-linked immunosorbent assay.