Professional Documents
Culture Documents
BIOLGICOS
Gabriel Dequigiovanni
Departamento de Gentica
gabriel.dequi@gmail.com
1866
1903
1913
1944
1945
1953
1960s
1977
1983
1995
1998
1999
2000
2001
2003
2004
2005
2007
2009
LOUSA
Animao!
http:/www.dnalc.org/ddnalc/resources/sangerseq.html
LASER
Deteco
a laser
Animao!
http:/www.dnalc.org/ddnalc/resources/cycseq.html
454 Roche
fragmentos
Reao de sequenciamento
Eletroforese capilar
Anlise computacional
FRAGMENTOS COMPLETOS
BIOINFORMTICA
A bioinformtica consiste no desenvolvimento de
mtodos computacionais, matemticos e estatsticos
para organizar e analisar informaes biolgicas em
grande escala e de maneira integrada.
Organizao
e Armazenamento
Visualizao
e Anlise
- Ferramentas computacionais
- Compreenso do significado biolgico
Voc toparia?
Neanthertal
James Watson
Craig Venter
430.000 anos
2019
Arroz
Soja
Arabdopsis
Milho
Tomate
Poplar Genome Sequenced and Published;
Model Crop for Biofuels
Organismo-especfico
http://flybase.org/
http://poultry.mph.msu.edu/
http://www.maizegdb.org/
http://rice.plantbiology.msu.edu/
http://www.yeastgenome.org/
http://soybeangenome.siu.edu/
http://www.ornl.gov/sci/techresources/Human_Genome/ho
me.shtml
Sequenciamento de genomas:
http://www.insdc.org/
INSDC
USA
NCBI/NLM
Europe
EBI/EMBL
Atualizaes dirias
Troca de informaes
FERRAMENTAS
ENTREZ: ferramenta
de busca do banco de
dados do NCBI
PubMed: artigos
cientficos
ESTRUTURA DO GENBANK
http://www.ncbi.nlm.nih.gov/Database/index.html
https://www.ncbi.nlm.nih.gov/nuccore/AH003701.2
>gi|226347323|gb|ACO50079.1| ribulose-1,5-bisphosphate
carboxylase/oxygenase large subunit [Anabaena planctonica CENA210]
GEIKGHYLNVTAPTCEEMLKRAEYAKELKMPIIMHDYLTAGFTANTTLARWCRDNGILLHIHRAMHAVID
RQKNHGIHFRVLAKALRLSGGDHIHTGTVVGKLEGERGITMGFVDLLRENYVEQDKSRGIYFTQDWASLP
GVMAVASGGIHVWHMPALVEIFGDDSVLQFGGGTLGHPWGNAPGATANRVALKAVVQARNEGRNLAREGN
DIIREAAKWSPELAVACEL
BUSCA EM BLAST
BLAST: Basic Local Alignment Search Tool
Por sequncia de nucleotdeos ou de aminocidos (protenas)
Comparao de sequncias a fim de identificar similaridade de
DNA ou protena para inferir origem, funo, filogenia
Realiza comparaes entre pares de sequncias, buscando
regies com similaridade local
Alinhamento local (segmentos) a base da busca por BLAST
Usa algoritmos para gerar alinhamento de sequncias
BUSCA EM BLAST
BUSCA EM BLAST
BUSCA EM BLAST
Algoritmos em Blast:
No avaliam homologia
Exemplos:
rgos homlogos asas de morcego e mos de humanos (mesma origem)
rgos similares asas de morcego e asas de borboleta (mesma funo)
BUSCA EM BLAST
Identidade x Similaridade x Homologia
Identidade = ocorrncia do mesmo nucleotdeo
aminocido na mesma posio nas seqncias alinhadas
ou
BUSCA EM BLAST
Nossa sequncia query (consulta),
O resultado da busca em BLAST pode ser um ou mais hits em
sequncias-sujeito (subject)
significativo o alinhamento!!!
Nucleotdeos
GGCTCTTTAGCTTCTTAGGACAGCACTTCCTGATT
TTGTTTTCAACTTCTAATCCTTTGAGTGTTTTTCA
TTCTGCAGATGCTGAGTTTGTGTGTGAACGGACAC
TGAAATATTTTCTAGGTGCGGGAGGAAAATGGGTA
GTTAGCTATTTCTGTAAGTATAATACTATTTCTCC
CCTCCTCCCTTTAACACCTCAGAATTGCATTTTTA
CACCTAACGTTTAACACCTAAGGTTTTTGCTGATG
CTGAGTCTGAGTTACCAAAAGGTCTTTAATTGTAA
TACTAAACTACTTTTATCTTTAATATCACTTTGTT
CAGATAAGCTGGTGATGCTGGGAAAATGGGTCTC
Z96068.1
Protena
>EAX11622.1 lactase [Homo sapiens]
MELSWHVVFIALLSFSCWGSDWESDRNFISTAGPLTNDLLHNLSGLLGDQSSNFVAGDKDMYVCHQPLPT
FLPEYFSSLHASQITHYKVFLSWAQLLPAGSTQNPDEKTVQCYRRLLKALKTARLQPMVILHHQTLPAST
LRRTEAFADLFADYATFAFHSFGDLVGIWFTFSDLEEVIKELPHQESRASQLQTLSDAHRKAYEIYHESY
AFQGGKLSVVLRAEDIPELLLEPPISALAQDTVDFLSLDLSYECQNEASLRQKLSKLQTIEPKVKVFIFN
LKLPDCPSTMKNPASLLFSLFEAINKDQVLTIGFDINEFLSCSSSSKKSMSCSLTGSLALQPDQQQDHET
TDSSPASAYQRVWEAFANQSRAERDAFLQDTFPEGFLWGASTGAFNVEGGWAEGGRGVSIWDPRRPLNTT
EGQATLEVASDSYHKVASDVALLCGLRAQVYKFSISWSRIFPMGHGSSPSLPGVAYYNKLIDRLQDAGIE
PMATLFHWDLPQALQDHGGWQNESVVDAFLDYAAFCFSTFGDRVKLWVTFHEPWVMSYAGYGTGQHPPGI
SDPGVASFKVAHLVLKAHARTWHHYNSHHRPQQQGHVGIVLNSDWAEPLSPERPEDLRASERFLHFMLGW
FAHPVFVDGDYPATLRTQIQQMNRQCSHPVAQLPEFTEAEKQLLKGSADFLGLSHYTSRLISNAPQNTCI
PSYDTIGGFSQHVNHVWPQTSSSWIRVVPWGIRRLLQFVSLEYTRGKVPIYLAGNGMPIGESENLFDDSL
RVDYFNQYINEVLKAIKEDSVDVRSYIARSLIDGFEGPSGYSQRFGLHHVNFSDSSKSRTPRKSAYFFTS
IIEKNGFLTKGAKRLLPPNTVNLPSKVRAFTFPSEVPSKAKVVWEKFSSQPKFERDLFYHGTFRDDFLWG
VSSSAYQIEGAWDADGKGPSIWDNFTHTPGSNVKDNATGDIACDSYHQLDADLNMLRALKVKAYRFSISW
SRIFPTGRNSSINSHGVDYYNRLINGLVASNIFPMVTLFHWDLPQALQDIGGWENPALIDLFDSYADFCF
QTFGDRVKFWMTFNEPMYLAWLGYGSGEFPPGVKDPGWAPYRIAHAVIKAHARVYHTYDEKYRQEQKGVI
SLSLSTHWAEPKSPGVPRDVEAADRMLQFSLGWFAHPIFRNGDYPDTMKWKVGNRSELQHLATSRLPSFT
EEEKRFIRATADVFCLNTYYSRIVQHKTPRLNPPSYEDDQEMAEEEDPSWPSTAMNRAAPWGTRRLLNWI
KEEYGDIPIYITENGVGLTNPNTEDTDRIFYHKTYINEALKAYRLDGIDLRGYVAWSLMDNFEWLNGYTV
KFGLYHVDFNNTNRPRTARASARYYTEVITNNGMPLAREDEFLYGRFPEGFIWSAASAAYQIEGAWRADG
KGLSIWDTFSHTPLRVENDAIGDVACDSYHKIAEDLVTLQNLGVSHYRFSISWSRILPDGTTRYINEAGL
NYYVRLIDTLLAASIQPQVTIYHWDLPQTLQDVGGWENETIVQRFKEYADVLFQRLGDKVKFWITLNEPF
VIAYQGYGYGTAAPGVSNRPGTAPYIVGHNLIKAHAEAWHLYNDVYRASQGGVISITISSDWAEPRDPSN
QEDVEAARRYVQFMGGWFAHPIFKNGDYNEVMKTRIRDRSLAAGLNKSRLPEFTESEKRRINGTYDFFGF
NHYTTVLAYNLNYATAISSFDADRGVASIADRSWPDSGSFWLKMTPFGFRRILNWLKEEYNDPPIYVTEN
GVSQREETDLNDTARIYYLRTYINEALKAVQDKVDLRGYTVWSAMDNFEWATGFSERFGLHFVNYSDPSL
PRIPKASAKFYASVVRCNGFPDPATGPHACLHQPDAGPTISPVRQEEVQFLGLMLGTTEAQTALYVLFSL
VLLGVCGLAFLSYKYCKRSKQGKTQRSQQELSPVSSF
EAX11622.1
BLASTn
BLASTp
Barra = Identidade
BUSCA EM BLAST
PROTENAS
FORMATO FASTA
>gi|47933334|gb|AAQ63935.1| cellulose synthase [Pinus radiata]
MEARTNTAAGSNKRNVRVSVRDDGELGPKPPQHINSHICQICGEDV
GLAADGEFFVACNECAFPVCRPCYEYEWKDGNQSCPQCKTRYKWH
KGSPQVDGDKEDECADDLDHDFNSTQGNRNEKQQIAEAMLHWQM
AYGRGEDVGPSRSESQELPQLQVPLITNGQAISGELPAGSSEYRRIA
APPTGGGSGKRVHPLPFPDSTQTGQVRA
>LINHA DO NOME
AY751548.1
L03637.1
AJ005984.1
NM_001246552.1
G24983.1
BK000460
NM_001045493.1
NM_001114949.1
BC037526.1
AB081072.1
AY136463.1
BC009121.1
AB052957.1