
Abstract Background The ESTree db http://www.itb.cnr.it/estree/ represents a collection of Prunus persica expressed sequenced tags (ESTs) and is intended as a resource for peach functional genomics. A total of 6,155 successful EST sequences were obtained from four in-house prepared cDNA libraries from Prunus persica mesocarps at different developmental stages. Another 12,475 peach EST sequences were downloaded from public databases and added to the ESTree db. An automated pipeline was prepared to process EST sequences using public software integrated by in-house developed Perl scripts and data were collected in a MySQL database. A php-based web interface was developed to query the database. Results The ESTree db version as of April 2005 encompasses 18,630 sequences representing eight libraries. Contig assembly was performed with CAP3. Putative single nucleotide polymorphism (SNP) detection was performed with the AutoSNP program and a search engine was implemented to retrieve results. All the sequences and all the contig consensus sequences were annotated both with blastx against the GenBank nr db and with GOblet against the viridiplantae section of the Gene Ontology db. Links to NiceZyme (Expasy) and to the KEGG metabolic pathways were provided. A local BLAST utility is available. A text search utility allows querying and browsing the database. Statistics were provided on Gene Ontology occurrences to assign sequences to Gene Ontology categories. Conclusion The resulting database is a comprehensive resource of data and links related to peach EST sequences. The Sequence Report and Contig Report pages work as the web interface core structures, giving quick access to data related to each sequence/contig.
DNA, Complementary, QH301-705.5, Bioinformatics, Computer applications to medicine. Medical informatics, R858-859.7, Genes, Plant, Biochemistry, Polymorphism, Single Nucleotide, User-Computer Interface, Gene Expression Regulation, Plant, Databases, Genetic, EST, Biology (General), Molecular Biology, database, Gene Library, Expressed Sequence Tags, Internet, Genome, Ontology, Chromosome Mapping, Computational Biology, Genomics, Sequence Analysis, DNA, Computer Science Applications, Database Management Systems, Programming Languages, Prunus, Sequence Alignment, Genome, Plant, Software, computational biology; databases, genetic; pattern recognition, automated; periodicals as topic; terminology as topic, Research Article
DNA, Complementary, QH301-705.5, Bioinformatics, Computer applications to medicine. Medical informatics, R858-859.7, Genes, Plant, Biochemistry, Polymorphism, Single Nucleotide, User-Computer Interface, Gene Expression Regulation, Plant, Databases, Genetic, EST, Biology (General), Molecular Biology, database, Gene Library, Expressed Sequence Tags, Internet, Genome, Ontology, Chromosome Mapping, Computational Biology, Genomics, Sequence Analysis, DNA, Computer Science Applications, Database Management Systems, Programming Languages, Prunus, Sequence Alignment, Genome, Plant, Software, computational biology; databases, genetic; pattern recognition, automated; periodicals as topic; terminology as topic, Research Article
| citations This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 48 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Top 10% | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Top 10% |
