NMR vs. Xtal issues: resolution, biological conformation, sample preparation
How much detail do we retain?
residues
C-alpha trace
all atoms (most typical for structural databases)
How do we represent structural information?
labeled graph: atom types + positions + connections
minimal information: atom types + positions + rules for making connections
What does experimental data provide?
What about fluctuating structures?
PDB www.rcsb.org the Protein Data Bank, now at Rutgers
PDB file information
MMDB http://www.ncbi.nlm.nih.gov/Structure/
Sequence neighbors vs. structure neighbors. Divergent vs. convergent evolution.
Viewing structures and structure neighbors
Assembly algorithm
Example: building an assembly from ESTs
Take this sequence into CuraTools:
1 cgagttcgtc aacgccgctt tcaacgtgac tgtggtggcc acaacacgtg tgggactccg 61 cccgaggaat actgtgtgca gaccggggtg accgggtcac aagtcctgtc acctgtgcga 121 cgccgggcag ccccacctgc agcacagggc agccttcctg accgactaca acaaccaggc 181 cgacaccacc tggtggcaaa gcagagccat gctBlast against DB Est to build a cluster
Automated iteration: The Est Extractor at http://hercules.tigem.it/BLASTEXTRACT/estextract.html
1 tcaggagcca gccccaccct tagaaaagat gttttccatg aggatcgtct gcctggtcct 61 aagtgtggtg ggcacagcat ggactgcaga tagtggtgaa ggtgactttc tagctgangg 121 aggaggcgtg cgtggcccaa gggttgtgga aagacatcaa tctgcctgca aagattcaga 181 ctggcccttc tgctctgatg aagactggaa ctacaaatgc ccttctggct gcaggatgaa 241 aagggttgat tgatgaagtc aatcaagatt ttacaaacag aataaataag ctcaaaaatt 301 cactatttga atatcagaag ancaataagg attctcattc gttgaccact aatataatgg 361 gaaattttga gaggcgattt ttcctcagcc aattaaccgt ggataatacc tacaaccgag 421 tgtccagagg atctgaggan gcaggaattt gaagtcctga agcgcaaagt cataggaaaa 481 gtncagcata tccagcttct ncagaaantg ttaggagctc ngtttggtSee if you can assemble it with other ESTs to build a contig. How long is the contig?
1 aaatttgtca tggatggagg gtatctggat caacccgaca actgtccaga gagagtcact 61 gacctcatgc gcatgtgctg gcaattcaac cccaagatga ggccaacctt cctggagatt 121 gtcaacctgc tcaaggacga cctgcacccc agctttccag aggtgtcgtt cttccacagc 181 gaggagaaca aggctcccga gagtgaggag ctggagatgg agtttgagga catggagaat 241 gtgcccctgg accgttcctc gcactgtcag agggaggagg cggggggccg ggatggaggg 301 tcctcgctgg gtttcaagcg gctacgagga acacatccct tacacacaca tgaacggagg 361 caagaaaaac gggcggattc tgaccttgcc tcggtccaat ccttcctaac agtgcctacc 421 gtggcggggg cgggcagggg ttccattttc gctttcctct ggtttgaaag cctctggaaa 481 actcaggatt ctcacgactc taccatgtcc aatggagttc agagatcgtt cctatacatt 541 tctgttcatc ttaaggtgga ctcgtttggt taccaattta aSee if you can build an EST assembly.