HarvEST

                 

 

 

 

 

Overview of HarvEST

HarvEST is now available for the following organisms: Barley, Brachypodium, Citrus, Coffea, Cowpea, Soybean, Rice, Wheat.  HarvEST software for Windows can be downloaded from this site (see links below). For online functionality (slower than Windows, not all functions are available online yet) many functions of HarvEST can be used by connecting to www.harvest-web.org.

HarvEST is principally EST database-viewing software that emphasizes gene function and is oriented to comparative genomics and the design of oligonucleotides, in support of activities such as microarray content design, function annotation, physical and genetic mapping. HarvEST:Barley has additional functions to support comparative genome mapping. HarvEST is developed at the University of California, Riverside by Steve Wanamaker, Timothy Close, Mikeal Roose and Matthew Lyon. The software is downloadable from this web site as installation files (see postings below) that allow HarvEST to run on any PC that uses Windows and meets the minimum hardware configuration (including Macintosh computers with a Windows emulator). Unless otherwise stated, all EST sequences in HarvEST have been quality trimmed, cleaned of vector, and assembled using CAP3 and quality values (special thanks to Xiaoqiu Huang for new versions of CAP3). Exceptions to this rule are versions of HarvEST containing assemblies received from Affymetrix, which were the basis of Affymetrix rice, soybean and wheat genome arrays. HarvEST includes an ACE file viewer that allows the user to examine the sequence alignment and readily determine where individual sequences reliably deviate from a consensus sequence. For key features of HarvEST, no Internet connection is required. Key features include: a choice of assemblies, sequence alignment viewing, archived BLAST hit information, unigene export, Boolean searching with user-defined quantitative settings, and various other searching, reporting and export functions. The unigene export feature can be applied to entire assemblies, individual libraries, or any combination of libraries. For Boolean searches, a user sets parameters according to their interest. For example, the choice could be genes associated with a particular stress (e.g. low temperature, heat, drought, aluminum, nitrogen deprivation), developmental stage (e.g. malted seed, pre-anthesis spike, mature fruit), or tissue (e.g. endosperm, pericarp, embryo sac, rachis, root tip, rind). The user also decides what percentage representation is relevant. Once the choices have been made, the search is executed locally and a browsable output is displayed. The output can be viewed on-screen or exported as summary tables or sequence files. HarvEST also contains hyperlinks to other sequence databases and facilitates connection to NCBI for live BLAST searches of the public dbEST database.

    Please check this site periodically for new versions. Subsequent versions will respond to user feedback, and contain updated information and new functions, including web-operability. If you would like to be automatically notified of new versions, please send a message asking that you are placed on a notification list.

HarvEST:Cowpea

Version 1.05 of "HarvEST:Cowpea" displays 17 libraries from Vigna unguiculata. ESTs sequenced at the US Department of Energy Joint Genome Institute (141,050 ESTs), the J Craig Venter Institute (41,505 ESTs) and the University of California Riverside (488 ESTs) have been derived from chromatograms using the full HarvEST pipeline. These ESTs, from 12 libraries produced at UC Riverside and 2 at the International Institute for Tropical Agriculture, retain their quality values and can be viewed more extensively than the remaining 75 ESTs, which were downloaded from the GenBank dbEST database by Steve Wanamaker at UC Riverside. We are eager to receive cowpea EST chromatograms from all sources and would be pleased to produce assemblies that include all existing cowpea ESTs. HarvEST:Cowpea contains best BLASTX hits from UniProt (April 2008), the Arabidopsis genome (TAIR version 8; April 2008) and the Medicago truncatula genome (IMGAG version 2; August 2007). Initial development of HarvEST:Cowpea was funded by the USDA Plant Genome program; ongoing development is currently supported by the CGIAR Generation Challenge Program (http://www.generationcp.org/index.php).

HarvEST:Barley

    Version 1.68 of "HarvEST:Barley" contains five assemblies (21, 25, 31, 32 and 35). Assemblies #21 and #25 are very closely related to the "Barley1" microarray produced by Affymetrix as part of the USDA-IFAFS project, "An Integrated Physical and Expression Map of Barley for Triticeae Improvement". Assembly #31 was used for overgo probe design to screen a Morex BAC library for gene-bearing clones in the NSF Plant Genome Research project, "Coupling EST Sequences and BAC Resources to Access the Barley Genome". Assemblies #32 and #35 were the main sources of SNPs for Illumina oligonucleotide pool assays developed initially for the same NSF Plant Genome Research project and subsequently the USDA-CSREES Barley Coordinated Agricultural Project (BarleyCAP; www.barleycap.org). Version 1.68 includes a barley genetic map and rice synteny viewer including 2943 barley SNP loci; the map viewer is functional when assembly #32 or #35 is active. Version 1.68 (562 MB download, 892 installed) also is enabled with special features for viewing the Affymetrix "Barley1" microarray content, including probe set location, probe sequences, and enhanced probe set annotations. Version 1.68 contains barley EST data sets of more than 30,000 each from USDA-funded projects in the US (Wing et al.), IPK Gatersleben (Andreas Graner et al.), Okayama University and the National Institute of Genetics (Kaz Sato et al.), Scottish Crop Research Institute (Robbie Waugh et al.), and University of Helsinki (Alan Schulman et al.), as well as smaller (less than 3,000 each) datasets of barley ESTs, whole cDNAs and genomics sequences from several other contributors. HarvEST:Barley contains best BLASTX hits from the UniProt database, and from the annotated rice (TIGR version 5; February 2007) and Arabidopsis (TAIR version 8; April 2008) genomes. Development of HarvEST:Barley has been funded by the USDA Plant Genome program and the NSF Plant Genome Research Program.

Click here to see more about the relationship of HarvEST:Barley and the Affymetrix Barley1 GeneChip including a powerful probe set annotation tool available in HarvEST:Barley.

HarvEST:Citrus

   Version 1.20 of "HarvEST:Citrus" displays 89 libraries and 229,570 ESTs from Citrus and Poncirus.  ESTs from 16 libraries produced by University of California Riverside projects (Timothy Close & Mikeal Roose), ten by University of California Davis (Abhaya Dandekar), five by USDA/ARS US Horticultural Research Lab in Ft. Pierce, Florida (Robert Shatters, Michael Bausher, Jose Chaparro, Greg McCollum) and two from Volcani Center, Israel (Avi Sadka) have been derived from chromatograms using the full HarvEST pipeline. These ~140,000 ESTs retain their phred quality values and therefore can be viewed more extensively than other sequences in HarvEST:Citrus. All other sequences were downloaded as flat files from GenBank nr or dbEST. In general, the dbEST flat file sequences have been truncated to attempt to include only high quality sequence regions in HarvEST:Citrus. Sequences derived from GenBank flat files are from Universidad Politecnica de Valencia, Valencia, Spain (~71,000 ESTs); USDA/ARS US Horticultural Research Lab in Ft. Pierce (~11,000 ESTs); East Tennessee State University (~3,000 ESTs); Laboratory of Biotechnology & Citrus Genome Analysis Team, Shizuoka, Japan (~2,500 ESTs); smaller numbers of ESTs from several other contributors; and cDNA or genomic sequences from the GenBank nr database, including 40 microbial pathogen sequences. HarvEST:Citrus contains best BLASTX hits from UniProt (January 2007) the Arabidopsis genome (TAIR version 7; April 2007) and the poplar genome (JGI version 1.1). Initial development of HarvEST:Citrus was supported by funding from the USDACSREES Plant Genome program; further development has been supported by the California Citrus Research Board and the University of California Discovery Grant Program.

HarvEST:RiceChip

  Version 1.09 of "HarvEST:RiceChip"  utilizes the same probe set display and annotation functions as HarvEST:Barley, but includes only the assembly that was produced by Affymetrix for the rice GeneChip®. HarvEST:RiceChip contains best BLASTX hits from UniProt (January 2007), BLASTN from rice (TIGR version 5; February 2006), and BLASTX from Arabidopsis (TAIR version 7; April 2007) genomes. Development of HarvEST:Rice was funded by the USDA Plant Genome program.

HarvEST:SoyChip

Version 1.04 of "HarvEST:SoyChip" utilizes the same probe set display and annotation functions as HarvEST:Barley, but includes only the assembly that was produced by Affymetrix for the soybean GeneChip® content design. HarvEST:SoyChip contains best BLASTX hits from UniProt (January 2007), the  Arabidopsis genome (TAIR version 7; April 2007) and the Medicago truncatula genome (IMGAG; January 2007). Initial development of HarvEST:SoyChip was supported by the USDA Plant Genome program; updates are supported by the University of California Agricultural Experiment Station.

HarvEST:Wheat

    Version 1.14 of "HarvEST:Wheat" displays about 101,000 wheat and other Triticeae ESTs produced mainly by a NSF-sponsored wheat project.  HarvEST:Wheat contains four different assemblies and has the same functions HarvEST:Barley, including viewing functions for the Affymetrix wheat GeneChip®. A special version of HarvEST:Wheat called "HarvEST:WheatChip" utilizes the same probe set display and annotation functions as HarvEST:Barley and HarvEST:Wheat, but includes only the assembly that was produced by Affymetrix for wheat GeneChip® content design. Development of HarvEST:Wheat and HarvEST:WheatChip was funded by the USDA Plant Genome program.

HarvEST:Brachypodium

Version 0.52 of "HarvEST:Brachypodium" displays 6 libraries from Brachypodium distachyon. All sequences were downloaded from the GenBank dbEST database by Steve Wanamaker at UC Riverside.  HarvEST:Brachypodium contains best BLASTX hits from UniProt (January 2007) and the rice (TIGR version 5; February 2007) and Arabidopsis (TAIR version 7; April 2007) genomes. Initial development of HarvEST:Brachypodium was funded by the USDA Plant Genome program.

HarvEST:Coffea

Version 0.13 of "HarvEST:Coffea" displays 12 libraries from Coffea arabica, Coffea canephora or an interspecies hybrid. All sequences with quality values were received by Steve Wanamaker at UC Riverside from the Tanksley lab at Cornell University or downloaded as flat files from the GenBank dbEST database. HarvEST:Coffea contains best BLASTX hits from UniProt (January 2007) and the  Arabidopsis genome (TAIR version 7; April 2007). Initial development of HarvEST:Coffea was funded by the USDA Plant Genome program.

HarvEST:other

    We can apply the copyrighted HarvEST software to EST assemblies from any organism.  Click here to see the file requirements if you have an interest in working with us to do this.

HarvEST News

Under Development:

bullet Larger assemblies
bullet Global SNP finder
bullet Links to BAC clones and contigs
bullet Links to OligoSpawn overgo probe designer and database

Most Recent Releases:

June 25, 2008 HarvEST:Cowpea version 1.05 released. It has the following features:

bullet Sequence alignment viewer - to navigate within the CAP3 sequence alignments and view SNP's
bullet Two assemblies
bullet 183,118 cowpea ESTs
bullet Best BLASTX of UniProt (April 2008), Arabidopsis (TAIR version 8; April 2008) and Medicago truncatula (IMGAG version 12; January 2008) genomes, with hyperlinks
bullet 159 MB download, 231 MB installed
bullet Search by expression pattern
bullet Extensive "Output Unigene" and other export functions

June 25, 2008 HarvEST:Barley version 1.68 released.  It has the following features:

bullet Barley genetic map viewer (active from assemblies #32 and #35) including 2943 mapped SNP loci, including rice synteny view
bullet Batch export of genetic map coordinates, marker names, mapped unigene sequences with annotations
bullet Support of the Affymetrix "Barley1" chip: export of probe set annotations, graphical displays of probes on unigenes, other "Search the Barley Chip" functions
bullet Sequence alignment viewer - to navigate within the CAP3 sequence alignments and find SNP's
bullet 444,652 barley ESTs and about 1100 other barley sequences
bullet Best BLASTX of UniProt (January 2007), rice (TIGR version 5; February 2007) and Arabidopsis (TAIR version 8; April 2008) genomes, with hyperlinks
bullet Cross-references unigenes between different assemblies
bullet 562 MB download, 892 MB installed
bullet Search by expression pattern
bullet Extensive "Output Unigene" and other export functions

February 19, 2008 HarvEST:RiceChip version 1.09

January 4, 2008  HarvEST:Citrus version 1.20 released.  It has the following features:

bullet Sequence alignment viewer - to navigate within the CAP3 sequence alignments and find SNPs
bullet 229,570 sequences, including quality values for ~140,000 ESTs
bullet Best BLASTX of UniProt (January 2007), Arabidopsis (TAIR; April 2007) and poplar (JGI; version 1.1) with hyperlinks
bullet Arabidopsis and poplar map displays
bullet Contains the two "all citrus" assemblies (C37 & C38) used for the Affymetrix Citrus GeneChip®
bullet Support of the Affymetrix Citrus GeneChip®: exports up-to-date probe set annotations, graphical displays of probes on unigenes, other "Search the Citrus Chip" functions
bullet Displays Affymetrix Citrus GeneChip® probe positions and other probe details
bullet Six assemblies in total: five combining all citrus species, one with only Poncirus trifoliata (37,481), one with only Citrus sinensis (98,176)
bullet Cross-references unigenes between different assemblies
bullet 309 MB download, 555 MB installed
bullet Search by expression pattern
bullet Extensive "Output Unigene" and other export functions

November 26, 2007  HarvEST:Wheat version 1.14 released.  It has the following features:

bullet Support of the Affymetrix Wheat chip: exports up-to-date probe set annotations, graphical displays of probe sets on unigenes, other "Search the Wheat Chip" functions
bullet Sequence alignment viewer - to navigate within the CAP3 sequence alignments
bullet Four assemblies: two containing 101,107 Triticum, Aegilops and Secale ESTs; two containing 91,130 Triticum ESTs
bullet Best BLASTX of UniProt (January 2007), rice (TIGR version 5; February 2007) and Arabidopsis (TAIR version 7; April 2007) genomes, with hyperlinks
bullet 186 MB download, 330 MB installed
bullet Cross-references unigenes between different assemblies
bullet Search by expression pattern
bullet Extensive "Output Unigene" and other export functions

November 2, 2007 HarvEST:SoyChip version 1.04

November 2, 2007 HarvEST:Coffea version 0.13

November 2, 2007  HarvEST:WheatChip version 1.53 released.  It has the following features:

bullet Support of the Affymetrix Wheat chip: exports up-to-date probe set annotations, graphical displays of probe sets on unigenes, other "Search the Wheat Chip" functions
bullet Sequence alignment viewer - to navigate within the CAP3 sequence alignments
bullet Contains the assembly created and used by Affymetrix to design the wheat chip
bullet Best BLASTX of UniProt (January 2007), rice (TIGR version 5; February 2007) and Arabidopsis (TAIR version 7; April 2007) genomes, with hyperlinks
bullet 161 MB download, 414 MB installed
bullet "Output Unigene" and other export functions

May 1, 2007 HarvEST:Brachypodium version 0.52

Screen Shots

 Click Here to View HarvEST Screen Shots

Using HarvEST

Searches

  Search ESTs by Expression Pattern

Select libraries in which you wish to see EST’s by entering, in the Min % field above the "Include" column, a minimum threshold percentage (example 0.2%), and putting a check mark next to each desired library.

Select the libraries from which you wish to exclude EST’s by entering, in the Max % field above the "exclude column", a maximum threshold percentage (example 0.04%), and putting a check mark next to each of those libraries.

Search ESTs by best BLAST hit keyword

You may find ESTs in the database by searching by function using keywords.

Search by Genbank #, EST Name, or Unigene #

You may find ESTs by Genbank #, EST name, or HarvEST unigene #.  Note: HarvEST unigene numbers change between versions and differ between assemblies.

Search the Chip (Barley, Citrus, Rice, Soybean, Wheat)

You may input single probeset names or browse to a list of probe set names. You may output annotations or view the details of the unigenes. To generate annotations you may decide how many probes in a probe set must match an annotated unigene to absorb the annotation. The highest blastx score from any unigene touched by the probe set is reported.

Results Display

bullet

The unigene(s) selected through one of the above searches are shown on the left side.  The right side has alternative views, controlled by selections on the upper right.  The "Distribution Among Libraries" view displays where each member of the unigene was found within the libraries.   For each library, the number of unigene members, percentage of library, and a bar graph of the percentage of library are shown.  The "Alignment" view shows  the position and orientation of each EST within each CAP3 contig.  The "Sequence Alignment" view shows the full sequence information, including colorized representation of phred quality values and positions of deviation from the consensus sequence.

bullet

The best BLASTX hit against the translated NCBI nr database for the highest-scoring sequence in the selected contig is shown at the bottom left, and against the TIGR annotated rice (IRGSP) and/or Arabidopsis genome in the adjacent window(s).

bullet

Scrolling through the list of unigenes causes the display of libraries on the right to change to reflect the distribution of the each unigene that is highlighted.

bullet

Click on a library name to view a detailed description sheet for a library.

bullet

Click the View Members of Selected Unigene button for a browse-able list of the ESTs in the highlighted unigene.

bullet

Click the Output all Sequences from Selected Unigene button to export the highlighted unigene to a FASTA-formatted text file.

bullet

Click the View/Blast Consensus of Selected Unigene button to perform an NCBI BLASTX-nr search. (This requires an Internet connection.)

bullet

Click the Output the Above Unigene List button to create a FASTA or tab-delimited text file.

bullet

Click the Create a Unigene/Assembly Cross-Reference button to generate a tab-delimited text file.

Select a Different Assembly

bullet

HarvEST contains multiple assemblies.  Select this option to switch between them.  All other displays are in the context of the selected assembly.

Print Reports

bullet

The "Library Summary" shows the libraries are in the assembly and how many clones, contigs, contigs unique to the library, and singletons for each library.  The "Orientation Calls by Library Report" shows the number of forward and reverse orientation reads from each library.  Orientations are determined by a combination of sequencing primer information, presence of polyT or polyA, and best BLASTX orientation. The "Print a Summary by Source" lists the number of ESTs from each person, lab, or group.  These numbers are generally a little higher than reported elsewhere in HarvEST since some types of hidden ESTs that are in an intermediate processing stage may be counted.

About HarvEST Assemblies

    The assemblies in HarvEST:Citrus, HarvEST:Wheat and HarvEST:Barley were made by Steve Wanamaker using CAP3. The sequence clusters in these assemblies are not identical to clusters created by other programs. The unigene numbers are different for each assembly and do not correspond to unigene numbers in any other assembly or database. The assemblies in the HarvEST:Affymetrix versions for Wheat, Rice and Soybean were produced by Affymetrix and have been included in HarvEST software with permission of Affymetrix. HarvEST:Barley assembly #21 was the source of barley content for the Affymetrix barley genome array. HarvEST:Citrus assemblies #37 and #38 were the sources of citrus content for the Affymetrix citrus genome array.

System Recommendations & Requirements

bullet

Windows 95/98/Me, NT4/2000/XP/Vista (Windows is REQUIRED, for online function go to www.harvest-web.org)

bullet

1 GHz or higher processor recommended

bullet

1 GB free hard disk space is REQUIRED for HarvEST:Barley

bullet

512 MB RAM minimum recommended

bullet

1024 x 768 video resolution is REQUIRED

bullet

Internet connection (to use hyperlinks to other databases from HarvEST - optional, but nice)

Download the Most Recent Version

Click Here To Download HarvEST:Barley version 1.68 (6/25/2008)  562 MB download, 892 MB installed. This contains five different assemblies. See License Agreement (below) and System Requirements (above).

Click Here To Download HarvEST:Brachypodium version 0.52 (5/1/2007) 21 MB download, 30 MB installed. This contains two assemblies. See License Agreement (below) and System Requirements (above).

Click Here To Download HarvEST:Citrus version 1.20 (1/4/2008)  309 MB download, 555 MB installed. This contains six different assemblies. See License Agreement (below) and System Requirements (above).

Click Here To Download HarvEST:Coffee version 0.13 (11/2/2007) 55 MB download, 75 MB installed. This contains two assemblies. See License Agreement (below) and System Requirements (above).

Click Here To Download HarvEST:Cowpea version 1.05 (6/25/08) 159 MB download, 231 MB installed. This contains two assemblies. See License Agreement (below) and System Requirements (above).

Click Here To Download HarvEST:RiceChip version 1.09 (2/19/2008) 144 MB download, 320 MB installed. This contains one assembly. See License Agreement (below) and System Requirements (above).

Click Here To Download HarvEST:SoyChip version 1.04 (11/2/2007) 126 MB download, 346 MB installed. This contains one assembly. See License Agreement (below) and System Requirements (above).

Click Here To Download HarvEST:Wheat version 1.14 (11/26/2007) 186 MB download, 330 MB installed. This contains four different assemblies. See License Agreement (below) and System Requirements (above).

Click Here To Download HarvEST:WheatChip version 1.53 (11/2/2007) 161 MB download, 414 MB installed. This contains one assembly. See License Agreement (below) and System Requirements (above).

To Install

Download the H*.exe file to a local hard drive. Double click the H*.exe file to launch the Windows installer. The installation directory defaults to a sub-directory off of C:\HarvEST. You may select an alternate location if you wish. The different HarvEST programs can be installed without conflicting with each other. The Visual FoxPro runtime libraries will be installed and the executable and data files will be de-compressed to the installation directory.

To Upgrade to a New Version

Un-install the previous version through Windows (see below), then install the new version.

To Un-Install

Go to Start, Settings, Control Panel, Add/Remove programs, Click on HarvEST, click Remove. Any residual directory, contents or desktop icon can then be manually deleted. A separate uninstallation through Windows must be performed for each HarvEST software.

Troubleshooting

If you received the error message: "Error 2355 during installation:  The setup file HarvESTxxx.exe is damaged.", then you will need to re-download the file.

If you receive the following error message:

PROGRAM: FORM_HARVEST.LOAD

LINE.........: 54

ERROR # ..: 1961 A subdirectory or file c:\temp\ already exists.

Then you will need to delete the file or folder c:\temp.  HarvEST will re-create it properly.  Click My Computer, C-drive, find the temp file or folder.  Right-click on it. Click delete.

HarvEST License Agreement

To receive and use this program you must agree to the following conditions:

1. This software is available free of charge for academic purposes.  Contact us for a price quote if you do not make your data publicly available.

2. You may re-distribute HarvEST software as long as:
        a. You do not charge for it. 
        b. You do not alter it in any way.
        c. You do not include it as part of software that you charge for.

3. You agree to give us feedback concerning program performance, ease of use, the utility of various features, and any errors found.

Contacts for questions, bug reports, suggestions:

               Steve Wanamaker, steve.wanamaker@ucr.edu

               Timothy Close,
timothy.close@ucr.edu


Copyright Message

HarvEST software and this web page copyrights (C) 2001 - 2008 Steve Wanamaker, Timothy Close and University of California.  All Rights Reserved.