Overview of HarvEST
HarvEST is now available for the following organisms: Barley, Brachypodium, Citrus, Coffea, Cowpea, Soybean, Rice, Wheat. HarvEST software for Windows can be downloaded from this site (see links below). For online functionality (slower than Windows, not all functions are available online yet) many functions of HarvEST can be used by connecting to www.harvest-web.org.
HarvEST is principally EST database-viewing software that emphasizes gene function and is oriented to comparative genomics and the design of oligonucleotides, in support of activities such as microarray content design, function annotation, physical and genetic mapping. HarvEST:Barley has additional functions to support comparative genome mapping. HarvEST is developed at the University of California, Riverside by Steve Wanamaker, Timothy Close, Mikeal Roose and Matthew Lyon. The software is downloadable from this web site as installation files (see postings below) that allow HarvEST to run on any PC that uses Windows and meets the minimum hardware configuration (including Macintosh computers with a Windows emulator). Unless otherwise stated, all EST sequences in HarvEST have been quality trimmed, cleaned of vector, and assembled using CAP3 and quality values (special thanks to Xiaoqiu Huang for new versions of CAP3). Exceptions to this rule are versions of HarvEST containing assemblies received from Affymetrix, which were the basis of Affymetrix rice, soybean and wheat genome arrays. HarvEST includes an ACE file viewer that allows the user to examine the sequence alignment and readily determine where individual sequences reliably deviate from a consensus sequence. For key features of HarvEST, no Internet connection is required. Key features include: a choice of assemblies, sequence alignment viewing, archived BLAST hit information, unigene export, Boolean searching with user-defined quantitative settings, and various other searching, reporting and export functions. The unigene export feature can be applied to entire assemblies, individual libraries, or any combination of libraries. For Boolean searches, a user sets parameters according to their interest. For example, the choice could be genes associated with a particular stress (e.g. low temperature, heat, drought, aluminum, nitrogen deprivation), developmental stage (e.g. malted seed, pre-anthesis spike, mature fruit), or tissue (e.g. endosperm, pericarp, embryo sac, rachis, root tip, rind). The user also decides what percentage representation is relevant. Once the choices have been made, the search is executed locally and a browsable output is displayed. The output can be viewed on-screen or exported as summary tables or sequence files. HarvEST also contains hyperlinks to other sequence databases and facilitates connection to NCBI for live BLAST searches of the public dbEST database.
Please check this site periodically for new versions. Subsequent versions will respond to user feedback, and contain updated information and new functions, including web-operability. If you would like to be automatically notified of new versions, please send a message asking that you are placed on a notification list.
HarvEST:Cowpea
Version 1.14 of "HarvEST:Cowpea" displays 17 libraries from Vigna unguiculata. ESTs sequenced at the US Department of Energy Joint Genome Institute (141,050 ESTs), the J Craig Venter Institute (41,505 ESTs) and the University of California Riverside (488 ESTs) have been derived from chromatograms using the full HarvEST pipeline. These ESTs, from 12 libraries produced at UC Riverside and 2 at the International Institute for Tropical Agriculture, retain their quality values and can be viewed more extensively than the remaining 75 ESTs, which were downloaded from the GenBank dbEST database by Steve Wanamaker at UC Riverside. We are eager to receive cowpea EST chromatograms from all sources and would be pleased to produce assemblies that include all existing cowpea ESTs. HarvEST:Cowpea contains best BLASTX hits from the soybean genome (Phytozome Glyma1; December 2008), UniProt (April 2008), the Arabidopsis genome (TAIR version 8; April 2008) and the Medicago truncatula genome (IMGAG Mt2.0; May 2008). Initial development of HarvEST:Cowpea was funded by the USDA Plant Genome program; ongoing development is currently supported by the CGIAR Generation Challenge Program (http://www.generationcp.org/index.php).
HarvEST:Barley
Version 1.73 of "HarvEST:Barley" contains five assemblies (21, 25, 31, 32 and 35). Assemblies #21 and #25 are very closely related to the "Barley1" microarray produced by Affymetrix as part of the USDA-IFAFS project, "An Integrated Physical and Expression Map of Barley for Triticeae Improvement". Assembly #31 was used for overgo probe design to screen a Morex BAC library for gene-bearing clones in the NSF Plant Genome Research project, "Coupling EST Sequences and BAC Resources to Access the Barley Genome". Assemblies #32 and #35 were the main sources of SNPs for Illumina oligonucleotide pool assays developed initially for the same NSF Plant Genome Research project and subsequently the USDA-CSREES Barley Coordinated Agricultural Project (BarleyCAP; www.barleycap.org). Version 1.73 includes a barley genetic map and rice synteny viewer (Windows version only, not currently available through www.harvest-web.org) including 2943 barley SNP loci; the map viewer is functional when assembly #32 or #35 is active. Version 1.73 (559 MB download, 890 installed) also is enabled with special features for viewing the Affymetrix "Barley1" microarray content, including probe set location, probe sequences, and enhanced probe set annotations. Version 1.73 contains barley EST data sets of more than 30,000 each from USDA-funded projects in the US (Rod Wing et al.), IPK Gatersleben (Andreas Graner et al.), Okayama University and the National Institute of Genetics (Kaz Sato et al.), Scottish Crop Research Institute (Robbie Waugh et al.), and University of Helsinki (Alan Schulman et al.), as well as smaller (less than 3,000 each) datasets of barley ESTs, whole cDNAs and genomics sequences from several other contributors. HarvEST:Barley contains best BLASTX hits from UniProt (August 2008), the annotated rice (TIGR version 6; January 2009) and Arabidopsis (TAIR version 8; April 2008) genomes. Development of HarvEST:Barley was initially funded by the USDA Plant Genome program and the NSF Plant Genome Research Program, and currently is funded by the University of California Agricultural Experiment Station.
Click here to see more about the relationship of HarvEST:Barley and the Affymetrix Barley1 GeneChip including a powerful probe set annotation tool available in HarvEST:Barley.
HarvEST:Citrus
Version 1.25 of "HarvEST:Citrus" displays 89 libraries and 229,570 ESTs from Citrus and Poncirus. ESTs from 16 libraries produced at University of California Riverside (Timothy Close & Mikeal Roose), ten at University of California Davis (Abhaya Dandekar), five at USDA/ARS US Horticultural Research Lab in Ft. Pierce, Florida (Robert Shatters, Michael Bausher, Jose Chaparro, Greg McCollum) and two at Volcani Center, Israel (Avi Sadka) have been derived from chromatograms using the full HarvEST pipeline. These ~140,000 ESTs retain their phred quality values and therefore can be viewed more extensively than other sequences in HarvEST:Citrus. All other sequences were downloaded as flat files from GenBank nr or dbEST. In general, the dbEST flat file sequences have been truncated to attempt to include only high quality sequence regions in HarvEST:Citrus. Sequences derived from GenBank flat files are from Universidad Politecnica de Valencia, Valencia, Spain (~71,000 ESTs); USDA/ARS US Horticultural Research Lab in Ft. Pierce (~11,000 ESTs); East Tennessee State University (~3,000 ESTs); Laboratory of Biotechnology & Citrus Genome Analysis Team, Shizuoka, Japan (~2,500 ESTs); smaller numbers of ESTs from several other contributors; and cDNA or genomic sequences from the GenBank nr database, including 40 microbial pathogen sequences. HarvEST:Citrus contains best BLASTX hits from UniProt (August 2008), the Arabidopsis genome (TAIR version 8; April 2008) and the poplar genome (JGI version Poptr1.1). Initial development of HarvEST:Citrus was supported by funding from the USDACSREES Plant Genome program; further development was supported by the California Citrus Research Board and the University of California Discovery Grant Program.
HarvEST:RiceChip
Version 1.10 of "HarvEST:RiceChip" utilizes the same probe set display and annotation functions as HarvEST:Barley, but includes only the assembly that was produced by Affymetrix for the rice GeneChip®. HarvEST:RiceChip contains best BLASTX hits from UniProt (August 2008), BLASTN from rice (TIGR version 6; January 2009), and BLASTX from Arabidopsis (TAIR version 8; April 2008) genomes. Development of HarvEST:Rice was funded by the USDA Plant Genome program.
HarvEST:SoyChip
Version 1.05 of "HarvEST:SoyChip" utilizes the same probe set display and annotation functions as HarvEST:Barley, but includes only the assembly that was produced by Affymetrix for the soybean GeneChip® content design. HarvEST:SoyChip contains best BLASTX hits from UniProt (August 2008), the soybean genome (Phytozome Glyma1; December 2008), the Arabidopsis genome (TAIR version 8; April 2008) and the Medicago truncatula genome (IMGAG MT2; May 2008). Initial development of HarvEST:SoyChip was supported by the USDA Plant Genome program; updates are supported by the University of California Agricultural Experiment Station.
HarvEST:WheatChip
Version 1.55 of "HarvEST:WheatChip" utilizes the same probe set display and annotation functions as HarvEST:Barley, but includes only the assembly that was produced by Affymetrix for the wheat GeneChip® content design. HarvEST:WheatChip contains best BLASTX hits from UniProt (August 2008), the rice genome (TIGR version 6; January 2009) and the Arabidopsis genome (TAIR version 8; April 2008). HarvEST:Wheat is a somewhat related software. Version 1.16 of "HarvEST:Wheat" displaying about 101,000 wheat and other Triticeae ESTs produced mainly by a NSF-sponsored wheat project. Version 1.16 of HarvEST:Wheat contains four different assemblies. Initial development of HarvEST:WheatChip and HarvEST:Wheat was funded by the USDA Plant Genome program; updates are supported by the University of California Agricultural Experiment Station.
HarvEST:Brachypodium
Version 0.52 of "HarvEST:Brachypodium" displays 6 libraries from Brachypodium distachyon. All sequences were downloaded from the GenBank dbEST database by Steve Wanamaker at UC Riverside. HarvEST:Brachypodium contains best BLASTX hits from UniProt (January 2007) and the rice (TIGR version 5; February 2007) and Arabidopsis (TAIR version 7; April 2007) genomes. Initial development of HarvEST:Brachypodium was funded by the USDA Plant Genome program.
HarvEST:Coffea
Version 0.13 of "HarvEST:Coffea" displays 12 libraries from Coffea arabica, Coffea canephora or an interspecies hybrid. All sequences with quality values were received by Steve Wanamaker at UC Riverside from the Tanksley lab at Cornell University or downloaded as flat files from the GenBank dbEST database. HarvEST:Coffea contains best BLASTX hits from UniProt (January 2007) and the Arabidopsis genome (TAIR version 7; April 2007). Initial development of HarvEST:Coffea was funded by the USDA Plant Genome program.
We can apply the copyrighted HarvEST software to EST assemblies from any organism. Click here to see the file requirements if you have an interest in working with us to do this.
|
|
Larger assemblies |
|
|
Global SNP finder |
|
|
Links to BAC clones and contigs |
|
|
Links to OligoSpawn overgo probe designer and database |
|
|
Sequence alignment viewer - to navigate within the CAP3 sequence alignments and view SNP's |
|
|
Two assemblies |
|
|
183,118 cowpea ESTs |
|
|
Best BLASTX of UniProt (April 2008), Arabidopsis (TAIR version 8; April 2008) and Medicago truncatula (IMGAG Mt2.0; May 2008) genomes, with hyperlinks |
|
|
159 MB download, 241 MB installed |
|
|
Search by expression pattern |
|
|
Extensive "Output Unigene" and other export functions |
|
|
Barley genetic map viewer (active from assemblies #32 and #35) including 2943 mapped SNP loci, including rice synteny view with zoom-in (Windows only; not available through www.harvest-web.org) |
|
|
Batch export of genetic map coordinates, marker names, mapped unigene sequences with annotations |
|
|
Support of the Affymetrix "Barley1" chip: export of probe set annotations, graphical displays of probes on unigenes, other "Search the Barley Chip" functions |
|
|
Sequence alignment viewer - to navigate within the CAP3 sequence alignments and find SNP's |
|
|
444,652 barley ESTs and about 1100 other barley sequences |
|
|
Best BLASTX of UniProt (January 2007), rice (TIGR version 5; February 2007) and Arabidopsis (TAIR version 8; April 2008) genomes, with hyperlinks |
|
|
Cross-references unigenes between different assemblies |
|
|
559 MB download, 890 MB installed |
|
|
Search by expression pattern |
|
|
Extensive "Output Unigene" and other export functions |
|
|
Sequence alignment viewer - to navigate within the CAP3 sequence alignments and find SNPs |
|
|
229,570 sequences, including quality values for ~140,000 ESTs |
|
|
Best BLASTX of UniProt (August 2008), Arabidopsis (TAIR 8; April 2008) and poplar (JGI version Poptr1.1; Septembr 2006) with hyperlinks |
|
|
Arabidopsis and poplar map displays |
|
|
Contains the two "all citrus" assemblies (C37 & C38) used for the Affymetrix Citrus GeneChip® |
|
|
Support of the Affymetrix Citrus GeneChip®: exports up-to-date probe set annotations, graphical displays of probes on unigenes, other "Search the Citrus Chip" functions |
|
|
Displays Affymetrix Citrus GeneChip® probe positions and other probe details |
|
|
Seven assemblies in total: five combining all citrus species, one with only Poncirus trifoliata (37,481), one with only Citrus sinensis (98,176) |
|
|
Cross-references unigenes between different assemblies |
|
|
349 MB download, 620 MB installed |
|
|
Search by expression pattern |
|
|
Extensive "Output Unigene" and other export functions |
|
|
Support of the Affymetrix Wheat chip: exports up-to-date probe set annotations, graphical displays of probe sets on unigenes, other "Search the Wheat Chip" functions |
|
|
Sequence alignment viewer - to navigate within the CAP3 sequence alignments |
|
|
Contains the assembly created and used by Affymetrix to design the wheat chip |
|
|
Best BLASTX of UniProt (August 2008), rice (TIGR version 6; January 2009) and Arabidopsis (TAIR version 8; April 2008) genomes, with hyperlinks |
|
|
163 MB download, 420 MB installed |
|
|
"Output Unigene" and other export functions |
|
|
Support of the Affymetrix Wheat chip: exports up-to-date probe set annotations, graphical displays of probe sets on unigenes, other "Search the Wheat Chip" functions |
|
|
Sequence alignment viewer - to navigate within the CAP3 sequence alignments |
|
|
Four assemblies: two containing 101,107 Triticum, Aegilops and Secale ESTs; two containing 91,130 Triticum ESTs |
|
|
Best BLASTX of UniProt (August 2008), rice (TIGR version 6; January 2009) and Arabidopsis (TAIR version 8; April 2008) genomes, with hyperlinks |
|
|
189 MB download, 351 MB installed |
|
|
Cross-references unigenes between different assemblies |
|
|
Search by expression pattern |
|
|
Extensive "Output Unigene" and other export functions |
Click Here to View HarvEST Screen Shots
Select libraries in which you wish to see EST’s by entering, in the Min % field above the "Include" column, a minimum threshold percentage (example 0.2%), and putting a check mark next to each desired library.
Select the libraries from which you wish to exclude EST’s by entering, in the Max % field above the "exclude column", a maximum threshold percentage (example 0.04%), and putting a check mark next to each of those libraries.
You may find ESTs in the database by searching by function using keywords.
You may find ESTs by Genbank #, EST name, or HarvEST unigene #. Note: HarvEST unigene numbers change between versions and differ between assemblies.
You may input single probeset names or browse to a list of probe set names. You may output annotations or view the details of the unigenes. To generate annotations you may decide how many probes in a probe set must match an annotated unigene to absorb the annotation. The highest blastx score from any unigene touched by the probe set is reported.
|
|
The unigene(s) selected through one of the above searches are shown on the left side. The right side has alternative views, controlled by selections on the upper right. The "Distribution Among Libraries" view displays where each member of the unigene was found within the libraries. For each library, the number of unigene members, percentage of library, and a bar graph of the percentage of library are shown. The "Alignment" view shows the position and orientation of each EST within each CAP3 contig. The "Sequence Alignment" view shows the full sequence information, including colorized representation of phred quality values and positions of deviation from the consensus sequence. | |
|
|
The best BLASTX hit against the translated NCBI nr database for the highest-scoring sequence in the selected contig is shown at the bottom left, and against the TIGR annotated rice (IRGSP) and/or Arabidopsis genome in the adjacent window(s). | |
|
|
Scrolling
through the list of unigenes causes the display of libraries on the right to
change to reflect the distribution of the each unigene
that is highlighted. | |
|
|
Click on a library name to view a detailed description sheet for a library. | |
|
|
Click the View Members of Selected Unigene button for a browse-able list of the ESTs in the highlighted unigene. | |
|
|
Click the
Output all Sequences from Selected Unigene button to export the
highlighted unigene to a FASTA-formatted text file. | |
|
|
Click the
View/Blast Consensus of Selected Unigene button to perform an NCBI
BLASTX-nr search. (This requires an Internet connection.) | |
|
|
Click the Output the Above Unigene List button to create a FASTA or tab-delimited text file. |
|
|
|
Click the Create a Unigene/Assembly Cross-Reference button to generate a tab-delimited text file. |
|
|
HarvEST contains multiple assemblies. Select this option to switch between them. All other displays are in the context of the selected assembly. |
|
|
The "Library Summary" shows the libraries are in the assembly and how many clones, contigs, contigs unique to the library, and singletons for each library. The "Orientation Calls by Library Report" shows the number of forward and reverse orientation reads from each library. Orientations are determined by a combination of sequencing primer information, presence of polyT or polyA, and best BLASTX orientation. The "Print a Summary by Source" lists the number of ESTs from each person, lab, or group. These numbers are generally a little higher than reported elsewhere in HarvEST since some types of hidden ESTs that are in an intermediate processing stage may be counted. |
The assemblies in HarvEST:Citrus, HarvEST:Wheat and HarvEST:Barley were made by Steve Wanamaker using CAP3. The sequence clusters in these assemblies are not identical to clusters created by other programs. The unigene numbers are different for each assembly and do not correspond to unigene numbers in any other assembly or database. The assemblies in the HarvEST:Affymetrix versions for Wheat, Rice and Soybean were produced by Affymetrix and have been included in HarvEST software with permission of Affymetrix. HarvEST:Barley assembly #21 was the source of barley content for the Affymetrix barley genome array. HarvEST:Citrus assemblies #37 and #38 were the sources of citrus content for the Affymetrix citrus genome array.
|
|
Windows 95/98/Me, NT4/2000/XP/Vista (Windows is REQUIRED, for online function go to www.harvest-web.org) |
|
|
1 GHz or higher processor recommended |
|
|
1 GB free hard disk space is REQUIRED for HarvEST:Barley |
|
|
512
MB RAM minimum recommended |
|
|
1024 x 768
video resolution is REQUIRED |
|
|
Internet connection (to use hyperlinks to other databases from HarvEST - optional, but nice) |
Click Here To Download HarvEST:Barley version 1.73 (6/11/2009) 559 MB download, 890 MB installed. This contains five different assemblies. See License Agreement (below) and System Requirements (above).
Click Here To Download HarvEST:Brachypodium version 0.52 (5/1/2007) 21 MB download, 30 MB installed. This contains two assemblies. See License Agreement (below) and System Requirements (above).
Click Here To Download HarvEST:Citrus version 1.25 (6/11/2009) 349 MB download, 620 MB installed. This contains six different assemblies. See License Agreement (below) and System Requirements (above).
Click Here To Download HarvEST:Coffee version 0.13 (11/2/2007) 55 MB download, 75 MB installed. This contains two assemblies. See License Agreement (below) and System Requirements (above).
Click Here To Download HarvEST:Cowpea version 1.14 (6/11/2009) 159 MB download, 241 MB installed. This contains two assemblies. See License Agreement (below) and System Requirements (above).
Click Here To Download HarvEST:RiceChip version 1.10 (6/11/2009) 149 MB download, 346 MB installed. This contains one assembly. See License Agreement (below) and System Requirements (above).
Click Here To Download HarvEST:SoyChip version 1.05 (6/11/2009) 129 MB download, 366 MB installed. This contains one assembly. See License Agreement (below) and System Requirements (above).
Click Here To Download HarvEST:Wheat version 1.16 (6/11/2009) 189 MB download, 351 MB installed. This contains four different assemblies. See License Agreement (below) and System Requirements (above).
Click Here To Download HarvEST:WheatChip version 1.55 (6/11/2009) 163 MB download, 420 MB installed. This contains one assembly. See License Agreement (below) and System Requirements (above).
Download the H*.exe file to a local hard drive. Double click the H*.exe file to launch the Windows installer. The installation directory defaults to a sub-directory off of C:\HarvEST. You may select an alternate location if you wish. The different HarvEST programs can be installed without conflicting with each other. The Visual FoxPro runtime libraries will be installed and the executable and data files will be de-compressed to the installation directory.
Un-install the previous version through Windows (see below), then install the new version.
Go to Start, Settings, Control Panel, Add/Remove programs, Click on HarvEST, click Remove. Any residual directory, contents or desktop icon can then be manually deleted. A separate uninstallation through Windows must be performed for each HarvEST software.
To
receive and use this program you must agree to the following conditions:
1. This software is available free of charge for academic purposes. Contact us for a price quote if you do not make your data publicly available.
2. You may re-distribute HarvEST software as long as:
a. You do not charge for it.
b. You do not alter it in any
way.
c. You do not include it as part of software that you charge for.
3. You agree to give us feedback concerning program performance, ease of use, the utility of various features, and any errors found.
Contacts for questions, bug reports, suggestions:
Steve
Wanamaker, steve.wanamaker@ucr.edu
