Link to Home

<<
 

1. Introduction to GenomeComp

Chapter Contents
1.1 What is GenomeComp?
1.2 System Requirements
1.3 Getting and Installing GenomeComp
1.3.1 Installation Instructions for Unix/Linux
1.3.2 Installation Instructions for Mac OS X
1.3.3 Installation Instructions for Windows

 

What is GenomeComp?

GenomeComp is a DNA sequence comparison tool and graphical user interface (GUI) viewer implemented in Perl/Tk. It can read one, two or three input sequences in either of FASTA, GENBANK or EMBL format, then it runs the external program BLASTN or MegaBlast to do the comparison between paired sequences in background. If the results already exist, it can skip that step and read the output file directly. Then the dynamic display of the results will show in the canvas of the main window.

GenomeComp can easily manage genome wide sequences (about several mega base pairs), so as to locate the rearrangements, insertions or deletions of genome segments between species or strains. In addition, it can show other features like repeat sequence distribution in one whole-genome sequence by self-comprison.

System Requirements

GenomeComp will run on any Unix/Linux, Mac OS X (tested on version 10.2.4 only) or Windows system ) that has a recent version of Perl with Tk module installed. For Unix/Linux systems Perl version 5.6 (or higher) and Tk version 800.022 (or higher) is recommend. While for MicroSoft Windows systems ActivePerl 521 (or higher) and Tk800.022 (or higher) is recommend.

Most of the orginal development has been done using Perl 5.004 with Tk 800.022 on an IRIX 6.5 system. The second release was developped mainly on a RedHat Linux 8.0 system with Perl 5.8 and Tk 804.024. Also RedHat Linux 7.0 system with Perl 5.6.0 and Tk 800.015, and Windows 98 and 2000 professional system with ActivePerl 521 and Tk 800.022 have been tested successful with the first release. However, the second release require at lease Tk800.022 to run!

New: The binary distributions of this program, which have no any system requirements, are available now! (Please visit GenomeComp home page for details.)

In comparison of microbial genomes (several million base pairs in length), we have not found technical difficulty as long as sufficient system memory is available. To the current release, the average memory cost for comparing general annotated genome sequences (i.e. GenBank or EMBL format) is about 5Mb/Mbp/sequence. For example, the whole genome comparison between E. coli K12 and S. flexneri 2a 301 (both are about 4.7Mbp) used nearly 50Mb memories in my Linux when testing.

The comparison speed should mainly determined by the CPU speed of running system besides the size of the input sequences. On my Linux (Intel P4 1.5G), the above mentioned comparison needed only about half minute with an existing BLAST output file.

Getting and Installing GenomeComp

The most up to date version of GenomeComp is always available from the GenomeComp home page.

Please skip the following contents of this part if you are using the binary distribution of this program. For source code distribution users, before installing GenomeComp you should make sure that your system have Perl and Tk module installed.

Unix/Linux systems usually have Perl installed when the systems were setup. If that, users can easily get it's version information by typing "perl -v" on command line (Of course Windows users also can use this method to test if Perl has been installed). Otherwise, go to the web page of Perl Ports (Binary Distributions) from CPAN, download and install the suitable binary version of Perl according to your system type. And the Tk module is available also on the CPAN site by searching the distribution called Tk or from the author's home page.

Windows users also can get the suitable distributions of Perl and Tk from CPAN's web site just like the above. But the more convenient and recommended way is to go directly to the ActiveState web site to get the recent version of ActivePerl for MSWin32 from the ActivePerl download page and the corresponding Tk module from the PPM Zip Archieves download page.

Installation Instructions for Unix/Linux

Change directory to where you wish to install GenomeComp in.

1) Unpack the archive. ("XXX" indicates your system type for packages of binary distributions)

> gzip -d < GenomeComp1.3[_XXX].tar.gz | tar xvf -

This will create a sub-directory called "GenomeComp1.3" in your current directory which will contain all the necessary files for running GenomeComp.

2) Set the environment parameter file for GenomeComp. (Note: this step is set optional now! Users can always using the program properly without this setting as long as keeping the integrating of this directory.)

Move into the directory called GenomeComp1.3 and edit the file called "ibprc" in this directory and change the line "Images=/yourpath/GenomeComp1.3/images" to the corresponding directory of the image files for GenomeComp on your machine. Then copy this file to your home direstory as ".ibprc" file.

>cp ibprc ~/.ibprc

3) Make the program executable (skip this step if it was executable already).

> chmod 755 GenomeComp1.3[_XXX]

4) (Optional) Link (don't move!) this program file to anywhere as you like (Such as into your system PATH), so you may run it by type "GenomeComp1.3[_XXX]" in the command line anywhere.

>ln -s /yourpath/GenomeComp1.3/GenomeComp1.3[_XXX] /where_you_want/

Installation Instructions for Mac OS X

To use GenomeComp on Mac OS X, one needs to install an X Windows manager. Apple has one available for free here:

http://www.apple.com/macosx/x11/download/

Then the latest version of Perl and Tk need to be installed. Instructions can be found here:

http://www.lehigh.edu/~sol0/Macintosh/X/ptk/

After that, the instructions should be the same as the regular Unix instructions in the above.

Installation Instructions for Windows

1) Unpack the archive by WinZip or other compress tools, extract the folder "GenomeComp1.3" to anywhere you wish (better in your system programs folder such as "C:\Program File").

2) (Optional) Create a shortcut link to the main program file in any convenient place of your system such as your Desktop. So you can call the program any time by double-clicking on the shortcut icon.

 
<<

Copyright @2002 Chinese National Human Genome Center,Beijing
All Rights Reserved