Bioinformatic Resources and BAC resources: Difference between pages

From EchinoWiki
(Difference between pages)
Jump to navigation Jump to search
imported>Ctelmer
m (Ctelmer moved page Bioinformatic resources to Bioinformatic Resources: There was not a link to the Main Page.)
 
imported>Echinobase
 
Line 1: Line 1:


== Genomic BAC Resources ==


===== Our Favorites =====
Echinobase stores libraries of genomic sequence fragments maintained in bacterial artificial chromosome (BAC) vectors for seven lower deuterostome species [[Echinobase/bac_table/bac_table.php|'''(BAC Table)''']]. These libraries were originally constructed as part of the Sea Urchin Genome Project. On average, the insert sizes for our BAC libraries are ~140 kb. Thus, for the 800 Mb ''S. purpuratus'' genome, 1X coverage is encompassed by, on average, 5,700 clones. Each library consists of >100,000 clones, providing ~17X genome coverage.


[http://blast.ncbi.nlm.nih.gov/Blast.cgi NCBI BLAST]
A subset of the ''S. purpuratus'' BACs have been mapped to the genome assembly. As part of the SUGP, about 8,000 BAC clones were end sequenced using Sanger technology. Genomic positions for these BACs are indicated as a track in JBrowse (link). An additional set of individual BACs were sequenced completely; links to the NCBI page is provided for these. Finally, some BACs have been identified using hybridization screening of arrayed library filters (see [[Echinobase/bac_table/bac_table.php|'''(BAC Table)''']])


[http://emboss.sourceforge.net/ EMBOSS]
'''BAC Vector Sequence'''. The BACs are maintained in the [http://bacpac.chori.org/pbace36.htm pBACe3.6 vector] which originates from Children's Hospital Oakland Research Institute in Oakland, California, USA. This vector confers resistance to the antibiotic chloramphenicol and is fully described by [http://www.ncbi.nlm.nih.gov/pubmed/10373322?dopt=Abstract Frengen and colleages]. The vector sequence is available [http://www.ncbi.nlm.nih.gov/nuccore/4878025?report=fasta here].


[http://hmmer.org/ HMMER]
'''Relevant publications:'''<br />
[http://www.pnas.org/content/97/17/9514.full A sea urchin genome project: Sequence scan, virtual map, and additional resources.] Cameron, RA, et al., 2000. Proc. Natl. Acad. Sci. USA.<br />
[[LINK. Bacterial artificial chromosomes as recombinant reporter constructs to investigate gene expression and regulation in echinoderms.]] Buckley, KM et al., 2017. Briefings in Functional Genomics.


[http://www.ncbi.nlm.nih.gov/spidey/ SPIDEY] :mRNA to genomic alignment


===== Multiple Alignment =====


[http://www.ebi.ac.uk/Tools/msa/clustalw2/ ClustalW2]
== BAC Protocols ==


[http://www.tcoffee.org/ T_Coffee]
Screening a BAC library<br />
Generating a recombinant BAC<br />
BAC miniprep protocol<br />
Analyzing BAC DNA by PFGE


[http://doua.prabi.fr/software/seaview Seaview:alignment editor]
<br />


===== Phylogenetic Analysis =====


[http://evolution.genetics.washington.edu/phylip.html Phylip 3.66:phylogenetic analysis]
== Recombinant BAC Resources ==


* Note: be sure to install libxaw7 and dev dbg headers packages before<br />
Echinobase has generated and maintains a set of reporter BAC constructs in which a portion of the gene of interest is replaced by the sequence encoding a fluorescent protein (''i.e.'' GFP, mCherry). A list of these sequences is available [[Echinobase/bac_table/bac_table.php|'''here''']]). Existing clones are available by request. We will also consider requests to make additional recombinant reporter constructs as time permits.
compile 3.66;
* Note: This package is compiled from the latest version, not the EMBOSS<br />
integrated phylip 3.6, which can be done by './configure --prefix=/usr/local'; not debian phylip 3.6.1.


Molphy 2.3b: Maximum Likelihood.
'''To request a recombinant or wildtype BAC for non-commercial use:'''<br />
1. Identify the BAC of interest computationally or by screening our libraries ([[Echinobase/filters|available filters]]).<br />
2. Request the BAC by emailing [mailto:kbuckle2@andrew.cmu.edu Kate Buckley] or [mailto:veronica@cmu.edu Veronica Hinman].


Tree-Puzzle 5.2: Maximum Likelihood.
<br />


[http://doua.prabi.fr/software/njplot NJplot tree drawing]


===== Gene Prediction =====
[[File:Bac.jpg|600px]]


[http://genes.mit.edu/GENSCAN.html GENSCAN]<br />
== Working with BACs ==
Note: The document can be found in /usr/local/share/genescan. Three trained models are available: HumanIso.smat, Arabidopsis.smat, Maize.smat.My personal experience is that Humalso is much better than other two for sea urchin.example: genscan /usr/local/share/genscan/HumanIso.smat mycontig.fasta


[https://ccb.jhu.edu/software/glimmerhmm/ GlimmerHMM, 2.0.4]<br />
'''Storing BACs'''
Note: The document can be found in /usr/local/share/GlimmerHMM. Two trained models are available: arabidopsis and rice. Not good for sea urchin. But user can train their own model. example: glimmerhmm<br />
Contig600303.fasta –d /usr/local/share/GlimmerHMM/trained_dir/arabidopsis/


[http://www.genezilla.org/ GeneZilla, 1.0, former TigerScan]<br />
'''Short term'''<br />
Note: Only one model trained for human available now(/usr/local/share/genezilla). User has to go genezilla directory to run the program. example: ./genezilla human.iso ~/mycontig.fasta
To avoid shearing the long BAC DNA fragments, ''isolated BACs should never be vortexed or frozen''. Store BAC DNA at 4 C in TE (10 mM Tris, pH = 7.4; 10 mM EDTA).<br />
'''Long term storage'''<br />
Store glycerol bacteria stocks (15-25% glycerol) at -80 C.


===== Other bioinformatics software =====
'''Antibiotic conditions for working with BAC clones'''<br />
 
'''Kanamycin'''<br />
[http://bioinfo.ut.ee/primer3-0.4.0/ Primer 3, Release 1.0]
Stock (25 μg/μL): Dissolve 0.5 g of kanamycin into 10 ml of ddH2O. Filter through a 0.22 µm filter to sterilize. Aliquot and store at -20°C.<br />
 
Use at 1:1000 dilution (25 μg/mL).<br />
[http://www.sanger.ac.uk/science/tools/artemis Artemis 7]:a DNA sequence viewer and annotation tool. Use 'art' to launch the artemis; use 'act' to launch ACT, a DNA sequence comparison viewer.
'''Chloramphenicol'''<br />
 
Stock (25 μg/μl): Dissolve in ETOH store at -20.<br />
[http://bioinformatics.org/cd-hit/ CD-HIT, v3.0.2(2006-0411)] :CD-HI/CD-HIT clusters protein sequence database at high sequence identity threshold. This program can remove the high sequence redundancy<br />
Use at 1:2000 dilution.(12.5 μg/ml)
efficiently.
 
[http://www.tbi.univie.ac.at/RNA/ ViennaRNA] : RNA Secondary Structure Prediction and Comparison.
 
[http://www.repeatmasker.org/ RepeatMasker 3.2.8, with RepBase 20090604]
 
===== Other Scientific Software =====
 
* R 2.9
* Gnuplot 4.2
* Grace 5.1.22
 
===== Programming Resources =====
 
* Python 2.3.5, BioPython 1.41
* Perl 5.16, BioPerl 1.4
* GCC 4.0.3
* J2SE 1.5
 
 
 
Credit: Original list compiled by Dr. Qiang Tu<br/>
Note: This page has been flagged as requiring an update as some software versions are now out of date.

Revision as of 18:31, 12 February 2020

Genomic BAC Resources

Echinobase stores libraries of genomic sequence fragments maintained in bacterial artificial chromosome (BAC) vectors for seven lower deuterostome species (BAC Table). These libraries were originally constructed as part of the Sea Urchin Genome Project. On average, the insert sizes for our BAC libraries are ~140 kb. Thus, for the 800 Mb S. purpuratus genome, 1X coverage is encompassed by, on average, 5,700 clones. Each library consists of >100,000 clones, providing ~17X genome coverage.

A subset of the S. purpuratus BACs have been mapped to the genome assembly. As part of the SUGP, about 8,000 BAC clones were end sequenced using Sanger technology. Genomic positions for these BACs are indicated as a track in JBrowse (link). An additional set of individual BACs were sequenced completely; links to the NCBI page is provided for these. Finally, some BACs have been identified using hybridization screening of arrayed library filters (see (BAC Table))

BAC Vector Sequence. The BACs are maintained in the pBACe3.6 vector which originates from Children's Hospital Oakland Research Institute in Oakland, California, USA. This vector confers resistance to the antibiotic chloramphenicol and is fully described by Frengen and colleages. The vector sequence is available here.

Relevant publications:
A sea urchin genome project: Sequence scan, virtual map, and additional resources. Cameron, RA, et al., 2000. Proc. Natl. Acad. Sci. USA.
LINK. Bacterial artificial chromosomes as recombinant reporter constructs to investigate gene expression and regulation in echinoderms. Buckley, KM et al., 2017. Briefings in Functional Genomics.


BAC Protocols

Screening a BAC library
Generating a recombinant BAC
BAC miniprep protocol
Analyzing BAC DNA by PFGE



Recombinant BAC Resources

Echinobase has generated and maintains a set of reporter BAC constructs in which a portion of the gene of interest is replaced by the sequence encoding a fluorescent protein (i.e. GFP, mCherry). A list of these sequences is available here). Existing clones are available by request. We will also consider requests to make additional recombinant reporter constructs as time permits.

To request a recombinant or wildtype BAC for non-commercial use:
1. Identify the BAC of interest computationally or by screening our libraries (available filters).
2. Request the BAC by emailing Kate Buckley or Veronica Hinman.



Working with BACs

Storing BACs

Short term
To avoid shearing the long BAC DNA fragments, isolated BACs should never be vortexed or frozen. Store BAC DNA at 4 C in TE (10 mM Tris, pH = 7.4; 10 mM EDTA).
Long term storage
Store glycerol bacteria stocks (15-25% glycerol) at -80 C.

Antibiotic conditions for working with BAC clones
Kanamycin
Stock (25 μg/μL): Dissolve 0.5 g of kanamycin into 10 ml of ddH2O. Filter through a 0.22 µm filter to sterilize. Aliquot and store at -20°C.
Use at 1:1000 dilution (25 μg/mL).
Chloramphenicol
Stock (25 μg/μl): Dissolve in ETOH store at -20.
Use at 1:2000 dilution.(12.5 μg/ml)