BLASTN 2.2.26+
Reference:
Zheng Zhang, Scott Schwartz, Lukas Wagner, and Webb Miller (2000),
"A greedy algorithm for aligning DNA sequences", J Comput Biol 2000;
7(1-2):203-14.
Database: TAIR10_cdna_20110103_representative_gene_model_updated
33,602 sequences; 51,074,197 total letters
Query= Ahg939784
Length=557
Score E
Sequences producing significant alignments: (Bits) Value
AT5G03930.1 | Symbols: | unknown protein; BEST Arabidopsis tha... 361 5e-99
AT5G03920.1 | Symbols: | unknown protein; BEST Arabidopsis tha... 268 3e-71
> AT5G03930.1 | Symbols: | unknown protein; BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT5G03920.1);
Has 16 Blast hits to 16 proteins in 2 species: Archae - 0;
Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 16; Viruses
- 0; Other Eukaryotes - 0 (source: NCBI BLink). | chr5:1059504-1060154
FORWARD LENGTH=651
Length=651
Score = 361 bits (195), Expect = 5e-99
Identities = 341/408 (84%), Gaps = 23/408 (6%)
Strand=Plus/Plus
Query 137 CACTTAAGAAACT-TGGTGCGGAGCTAGAGTTTGGCTCCAATAATCGGTTCCATATACAA 195
||||||||||| | ||| ||||||||||||| |||||||||| |||| ||||||||||||
Sbjct 149 CACTTAAGAAA-TGTGGCGCGGAGCTAGAGTGTGGCTCCAATGATCGATTCCATATACAA 207
Query 196 GTCGAAAACTTGAGCCATCTCGGAATGTCAAAGTCCAGTATGACTGGTGTAAATATCAAG 255
|||||||||||||||||| | ||||| || ||||| || ||| |||||||||| ||| ||
Sbjct 208 GTCGAAAACTTGAGCCATATGGGAATCTCCAAGTCTAGAATGGCTGGTGTAAAGATCGAG 267
Query 256 TGCGAAGTCATTTATAAGGATGATTGGGAAGAAGA--CGACAACGTCATAATCGTTGATG 313
||||| |||||||||||||| || | |||||| || || | || | ||||||| |||
Sbjct 268 TGCGAGGTCATTTATAAGGAAGACTCGGAAGACGAGGCGGTA--GTTAGAATCGTTAATG 325
Query 314 CTCGAA-GGCCATTAGACACAAACTTTGGATGGCTGAACAACGATCTTGTTACTCCAAAA 372
|| ||| ||||||||| ||||||||||||||||||||||||||||||| || ||||| |
Sbjct 326 CT-GAAAGGCCATTAGGCACAAACTTTGGATGGCTGAACAACGATCTTCTTGCTCCACTA 384
Query 373 GGTTATATCTTAAACCGCGGCCTAGTC-GCCGATATCACAG-TGAGTTTCAAGAGCGGGT 430
| | || | | | | | || | ||||| ||| ||| |||||||||| |||| ||
Sbjct 385 GCTCAT--CGT-----G-GTC-TAA-CAGCCGAAATC-CAGCTGAGTTTCAACAGCGAGT 433
Query 431 TATTTGACAAGATCGACCTACAGATATTCCATCAAGGCGAGGGGAGATATTTTTCCCTTG 490
|||| |||||||||||||| ||||||||||| | ||||||||||||||||||||||| |
Sbjct 434 TATTGGACAAGATCGACCTCCAGATATTCCACCGAGGCGAGGGGAGATATTTTTCCCAAG 493
Query 491 ACGATACTCTTACTTATCTTCAATGTTTTTTCCTC-TCTAAAATCTAA 537
||||||||||| |||||||||||||||||||| | ||||||||||||
Sbjct 494 ACGATACTCTTGGTTATCTTCAATGTTTTTTCC-CATCTAAAATCTAA 540
> AT5G03920.1 | Symbols: | unknown protein; BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT5G03930.1);
Has 30201 Blast hits to 17322 proteins in 780 species: Archae
- 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants
- 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI
BLink). | chr5:1058146-1058769 FORWARD LENGTH=624
Length=624
Score = 268 bits (145), Expect = 3e-71
Identities = 433/566 (77%), Gaps = 43/566 (8%)
Strand=Plus/Plus
Query 1 TAGATTGAGAAGTCCCC---CATCATGGCCAAAAAAGGAGAACATTCCGGAGCTAAGGAT 57
||||| ||||||||||| |||||||| || |||||||||| || |||||| | |
Sbjct 9 TAGATGGAGAAGTCCCCGGTGGTCATGGCCGAAGAAGGAGAACACTCGGGAGCT-A--A- 64
Query 58 GTGAAACTTGCCAAACCCAAAATCTCCATGTCGGATCTCACCTTTGTCATTTA--TGTA- 114
| | ||||| |||||||||||||||||||||||||||||||||||||||| ||||
Sbjct 65 G-G-AACTTCCCAAACCCAAAATCTCCATGTCGGATCTCACCTTTGTCATAGGCGTGTAC 122
Query 115 ACTACCGAAGAAAAGATGTTAGCACTTA-AGAAACTTGGTGCGGAGCTAGAGTTTGGCTC 173
|| ||| || ||||| |||| || || |||| |||| ||||| | |||||| | |
Sbjct 123 ACAACCAAATCAAAGAAGTTAACAGTTGTAGAAT-TTGGCGCGGACCGCGAGTTTAAC-C 180
Query 174 C-AATAATCGGTTCCATATACAAGTCGAAAACTTGAGCCATCTCGGAA-TGTCAAAGTCC 231
| ||| | |||||| | ||| || ||| | ||||||| | | | | || || | |||||
Sbjct 181 CTAATGACCGGTTCAAAATAGAAATCGTAGACTTGAGTCGTTTTG-AACTGCCCAAGTCT 239
Query 232 AGTATG-ACTGGTGTAAAT-ATCAAGTGCGAAGTCATTTATAAGGATGATTGGGAAGAAG 289
||||| | || ||||| | |||| || |||||| |||||||| | |||||| || ||
Sbjct 240 GGTATGGA-TGATGTAA-TGATCACTTGGGAAGTCGTTTATAAGAAAGATTGGCAA-AA- 295
Query 290 ACGACAACGTCATAATCGTTGATGCTCGAAGGCCATTAGACACAAACTTTGGATGGCTGA 349
| | |||| ||||||| | | ||||||| |||||||||||||||| ||||| ||
Sbjct 296 -C-ATT--GTCACAATCGTTAAAGGTCGAAGGTCATTAGACACAAACTTACGATGGTTGT 351
Query 350 ACAACGATCTTGTTACTCCAAAAGGT-TATATCTTAAACCGCGGCCTAGTCGCCGATATC 408
|||||||||| || ||||| ||| | || ||| | || | |||||| || |||| ||
Sbjct 352 TCAACGATCTTCTTGCTCCAGAAGATCTA-ATC--AGAC-GTGGCCTAACCGGCGATGTC 407
Query 409 ACAG-TGAGTTTCAA-GAGCGGGTTATTTGACAAGATC-GACCTACAGATATTCCATC-A 464
| || ||||||| || | ||| ||| ||||||||||| | ||| |||||||| || |
Sbjct 408 A-AGGTGAGTTTAAACG-GCGACTTACTTGACAAGATCAG-CCTTGAGATATTC-ATTGA 463
Query 465 AGGCGAGGGGAGATATTTTTCCCTTGACGATACTCTTACTTATCTTCAATGTTTTTTCCT 524
| |||| |||||||||||||| ||| |||||||||| |||||||||||||||||||
Sbjct 464 AAGCGACCAGAGATATTTTTCCCCTGAAGATACTCTTAGATATCTTCAATGTTTTTTCCC 523
Query 525 CTCTAAAATCTAACTTAT-ATATATA 549
|||||||| |||| |||| || ||||
Sbjct 524 CTCTAAAAACTAA-TTATTATGTATA 548
Lambda K H
1.33 0.621 1.12
Gapped
Lambda K H
1.28 0.460 0.850
Effective search space used: 26724566204
Database: TAIR10_cdna_20110103_representative_gene_model_updated
Posted date: Sep 25, 2014 6:13 PM
Number of letters in database: 51,074,197
Number of sequences in database: 33,602
Matrix: blastn matrix 1 -2
Gap Penalties: Existence: 0, Extension: 2.5