BLASTN 2.2.26+
Reference:
Zheng Zhang, Scott Schwartz, Lukas Wagner, and Webb Miller (2000),
"A greedy algorithm for aligning DNA sequences", J Comput Biol 2000;
7(1-2):203-14.
Database: TAIR10_cdna_20110103_representative_gene_model_updated
33,602 sequences; 51,074,197 total letters
Query= Ahg917054
Length=744
Score E
Sequences producing significant alignments: (Bits) Value
AT5G54062.1 | Symbols: | unknown protein; FUNCTIONS IN: molecu... 568 4e-161
AT5G53742.1 | Symbols: | Protein of unknown function (DUF1278)... 363 2e-99
AT3G61198.1 | Symbols: | other RNA | chr3:22655884-22656599 RE... 211 7e-54
AT5G54075.1 | Symbols: U3D | U3D; snoRNA | chr5:21945349-219457... 189 3e-47
AT5G53902.1 | Symbols: U3B | U3B; snoRNA | chr5:21887966-218886... 183 2e-45
AT1G10522.1 | Symbols: | unknown protein; FUNCTIONS IN: molecu... 178 7e-44
AT5G53740.1 | Symbols: | unknown protein; FUNCTIONS IN: molecu... 128 7e-29
AT5G53905.1 | Symbols: | unknown protein; CONTAINS InterPro DO... 97.1 2e-19
AT5G54070.1 | Symbols: AT-HSFA9, HSFA9 | heat shock transcripti... 58.4 1e-07
> AT5G54062.1 | Symbols: | unknown protein; FUNCTIONS IN: molecular_function
unknown; INVOLVED IN: biological_process unknown;
LOCATED IN: endomembrane system; CONTAINS InterPro DOMAIN/s:
Protein of unknown function DUF1278 (InterPro:IPR010701);
BEST Arabidopsis thaliana protein match is: Protein of unknown
function (DUF1278) (TAIR:AT5G53742.1); Has 30201 Blast
hits to 17322 proteins in 780 species: Archae - 12; Bacteria
- 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses
- 0; Other Eukaryotes - 2996 (source: NCBI BLink). | chr5:21938793-21939623
FORWARD LENGTH=831
Length=831
Score = 568 bits (307), Expect = 4e-161
Identities = 421/477 (88%), Gaps = 4/477 (1%)
Strand=Plus/Plus
Query 120 AGTTGATCTCACAAAATGTTGGTCATCACTCTTCAACGTTCAAGGTTGCAATATTGAAAT 179
|||||||||| |||||||||||||||| ||| ||||| |||| ||||||||||| |||||
Sbjct 190 AGTTGATCTCGCAAAATGTTGGTCATCGCTCCTCAACATTCACGGTTGCAATATCGAAAT 249
Query 180 CTTGAAATCTGCTTTAACGGGTAAGTTTGAAAACGTTGGATCCATC-TGCTGCAAGGCAT 238
||| ||||||| |||||| ||||||||||||||||||||||||| | ||||||||||| |
Sbjct 250 CTTTAAATCTGTTTTAACCGGTAAGTTTGAAAACGTTGGATCCA-CGTGCTGCAAGGCTT 308
Query 239 TTACGGAAGTGGATGCAAATTGTTGGCCAAAAATGTTTCCGCTAAATCCGTTATTCCCTT 298
|||| |||||||||||||| ||||||||||||||||||||| | ||||| |||||||||
Sbjct 309 TTACAGAAGTGGATGCAAAGTGTTGGCCAAAAATGTTTCCGTTGAATCCATTATTCCCTC 368
Query 299 CTCTTCTTAAGGATGGTTGTTCTCTCATCAGCGCAGCTGCACCCGCACACACGGCACCTC 358
||||||| ||||||||||| |||| ||||| | ||| |||||| ||||||| | |||| |
Sbjct 369 CTCTTCTCAAGGATGGTTGCTCTCGCATCATCTCAGGTGCACCAGCACACAAGACACCGC 428
Query 359 AATTCTCTGTCATCCCTGGTTCTTCGATCGATCTCACTAAATGTTTGTCATCACTTGTCA 418
||||| ||||||||||||||||| ||||||||||||| ||||||||||||||||||||||
Sbjct 429 AATTCCCTGTCATCCCTGGTTCTCCGATCGATCTCACAAAATGTTTGTCATCACTTGTCA 488
Query 419 ACGTTCAAGGTTGTGTAACAGAAATCCACAAATCAGTTTTCACAGGAAACTTTGGTAATG 478
||||||||||||||||||| |||||| |||||||||||||||||||||| | || ||| |
Sbjct 489 ACGTTCAAGGTTGTGTAACTGAAATCTACAAATCAGTTTTCACAGGAAAGTGTGATAACG 548
Query 479 TTGGAGCAATGTGCTGCAAAGCGTTT-TCGGCTGTGGATGCAAAATGTTGGCCACAAATG 537
| ||| ||||||||||| |||||| | ||||||||||||||||||||||||| |||||
Sbjct 549 TAGGATTTATGTGCTGCAAGGCGTTTAT-GGCTGTGGATGCAAAATGTTGGCCAAAAATG 607
Query 538 TTTCCGCTAAATCGGTTCTTTCCTTTTCTTCTCAAATCTAAATGTTCTCGCACCAAC 594
||||| ||||||| |||||| ||| ||| |||||| | ||||||||||| ||||
Sbjct 608 TTTCCACTAAATCCGTTCTTCCCTCCTCTCCTCAAAAATGTATGTTCTCGCATCAAC 664
> AT5G53742.1 | Symbols: | Protein of unknown function (DUF1278)
| chr5:21813671-21814018 FORWARD LENGTH=348
Length=348
Score = 363 bits (196), Expect = 2e-99
Identities = 289/333 (87%), Gaps = 10/333 (3%)
Strand=Plus/Plus
Query 271 ATGTTTCCGCTAAATCCGTTATTCCCTTCTCTTCTTAAGGATGGTTGTTCTCTCATCAGC 330
||||||||| | ||||| ||||||||| ||||||| ||||||||||| |||| ||||| |
Sbjct 1 ATGTTTCCGTTGAATCCATTATTCCCTCCTCTTCTCAAGGATGGTTGCTCTCGCATCATC 60
Query 331 GC-AGCTGCACCCGCACACACGGCACCTCAATT-CTCTGTCATCCCTGGTTCTTCGATCG 388
| || ||||| |||||||| ||||||||| || |||||||||||||||| ||||||
Sbjct 61 TCTGGC-GCACCAACACACACGAAACCTCAATTCCT-TGTCATCCCTGGTTCTCCGATCG 118
Query 389 ATCTCACTAAATGTTTGTCATCACTTGTCAACGTTCAAGGTTGTGTAACAGAAATCCACA 448
||||||| |||||||||||||||||||| |||||| ||||||| ||||| |||||| |||
Sbjct 119 ATCTCACAAAATGTTTGTCATCACTTGTTAACGTTGAAGGTTGCGTAACTGAAATCTACA 178
Query 449 AATCAGTTTTCACAGGAAACTTTGGTAATGTTGGAGCA-ATGTGCTGCAAAGCGTTTTCG 507
||||||||||||||||||| ||||||||||||||| | ||||||||||| ||||||||
Sbjct 179 AATCAGTTTTCACAGGAAAGTTTGGTAATGTTGGA-TATATGTGCTGCAAGGCGTTTTCA 237
Query 508 GCTGTGGATGCAAAATGTTGGCCACAAATGTTTCCGCTAAATCGGTTCTTTCCTTTTCTT 567
|||||||||| |||||||||||||||||||||||||||||||| |||||| ||| ||||
Sbjct 238 GCTGTGGATGTAAAATGTTGGCCACAAATGTTTCCGCTAAATCCGTTCTTCCCTCCTCTT 297
Query 568 CTCAA-ATCTAAATGTTCTCGC-ACCAACGCAG 598
||||| | || ||||||| | | ||||||||
Sbjct 298 CTCAAGAAGGAA-TGTTCTC-CTATCAACGCAG 328
> AT3G61198.1 | Symbols: | other RNA | chr3:22655884-22656599
REVERSE LENGTH=622
Length=622
Score = 211 bits (114), Expect = 7e-54
Identities = 128/135 (95%), Gaps = 0/135 (0%)
Strand=Plus/Minus
Query 610 ACCGCCGTGCAACCACCCCGGCAAGGTAGAACTGAGCCGAGTAACGATCCTCTACAGCAC 669
|||||||||||||||||||||||||||||||| |||||||||||||||||||||||||||
Sbjct 191 ACCGCCGTGCAACCACCCCGGCAAGGTAGAACCGAGCCGAGTAACGATCCTCTACAGCAC 132
Query 670 CAGATGCATCCGAAGACAATCGTGGCCGTTAATCACGCTCTGATCGCACGGTCATGGTTC 729
||||||| ||| ||||| ||||| ||||| ||||||||||||||||||||| ||||||||
Sbjct 131 CAGATGCGTCCTAAGACGATCGTAGCCGTCAATCACGCTCTGATCGCACGGCCATGGTTC 72
Query 730 ATCAACCAGGGTTAA 744
|||||||||||||||
Sbjct 71 ATCAACCAGGGTTAA 57
> AT5G54075.1 | Symbols: U3D | U3D; snoRNA | chr5:21945349-21945709
REVERSE LENGTH=361
Length=361
Score = 189 bits (102), Expect = 3e-47
Identities = 125/136 (92%), Gaps = 2/136 (1%)
Strand=Plus/Minus
Query 610 ACCGCCGTGCAACCACCCCGGCAAGGTAGAACTGAGCCGAGTAACGATCCTCTACAGCAC 669
|||||||||| ||||||||||||||||||||| |||||||||||||||||||||||||||
Sbjct 310 ACCGCCGTGCGACCACCCCGGCAAGGTAGAACCGAGCCGAGTAACGATCCTCTACAGCAC 251
Query 670 CAGATGCATCCGAAGACAATCGTGGCCGTTAATCACGCTCTGA-TCGCACGGTCATGGTT 728
| ||||| ||||| ||| ||||| ||||| ||||||||||| | |||||||||||||||
Sbjct 250 CGGATGCGTCCGAGGACGATCGTAGCCGTCAATCACGCTCT-AGCCGCACGGTCATGGTT 192
Query 729 CATCAACCAGGGTTAA 744
||||||||||||||||
Sbjct 191 CATCAACCAGGGTTAA 176
> AT5G53902.1 | Symbols: U3B | U3B; snoRNA | chr5:21887966-21888667
FORWARD LENGTH=702
Length=702
Score = 183 bits (99), Expect = 2e-45
Identities = 125/137 (91%), Gaps = 3/137 (2%)
Strand=Plus/Minus
Query 610 ACCGCCGTGCAACCACCCCGGCAAGGTAG-AACTGAGCCGAGTAACGATCCTCTACAGCA 668
|||||||||| |||||||||||||||||| ||| ||||||||||||||||||||||||||
Sbjct 535 ACCGCCGTGCGACCACCCCGGCAAGGTAGAAACCGAGCCGAGTAACGATCCTCTACAGCA 476
Query 669 CCAGATGCATCCGAAGACAATCGTGGCCGTTAATCACGCTCTGA-TCGCACGGTCATGGT 727
|| ||||| ||||| ||| ||||| ||||| ||||||||||| | ||||||||||||||
Sbjct 475 CCGGATGCGTCCGAGGACGATCGTAGCCGTCAATCACGCTCT-AGCCGCACGGTCATGGT 417
Query 728 TCATCAACCAGGGTTAA 744
|||||||||||||||||
Sbjct 416 TCATCAACCAGGGTTAA 400
> AT1G10522.1 | Symbols: | unknown protein; FUNCTIONS IN: molecular_function
unknown; INVOLVED IN: biological_process unknown;
LOCATED IN: chloroplast; EXPRESSED IN: 23 plant structures;
EXPRESSED DURING: 13 growth stages; Has 24 Blast hits to
24 proteins in 8 species: Archae - 0; Bacteria - 0; Metazoa
- 0; Fungi - 0; Plants - 24; Viruses - 0; Other Eukaryotes
- 0 (source: NCBI BLink). | chr1:3469636-3471488 FORWARD LENGTH=980
Length=980
Score = 178 bits (96), Expect = 7e-44
Identities = 121/133 (91%), Gaps = 1/133 (1%)
Strand=Plus/Minus
Query 610 ACCGCCGTGCAACCACCCCGGCAAGGTAGAACTGAGCCGAGTAACGATCCTCTACAGCAC 669
|||||||||| ||||||||||||||| || || ||||||||||||||||||||||||||
Sbjct 195 ACCGCCGTGCGACCACCCCGGCAAGGAAG-ACCAAGCCGAGTAACGATCCTCTACAGCAC 137
Query 670 CAGATGCATCCGAAGACAATCGTGGCCGTTAATCACGCTCTGATCGCACGGTCATGGTTC 729
| ||||| ||||||||| ||||| ||| | ||||||||||||| ||||||||||||||||
Sbjct 136 CGGATGCGTCCGAAGACGATCGTAGCCATCAATCACGCTCTGACCGCACGGTCATGGTTC 77
Query 730 ATCAACCAGGGTT 742
|||||||||||||
Sbjct 76 ATCAACCAGGGTT 64
> AT5G53740.1 | Symbols: | unknown protein; FUNCTIONS IN: molecular_function
unknown; INVOLVED IN: biological_process unknown;
LOCATED IN: chloroplast; EXPRESSED IN: synergid; BEST Arabidopsis
thaliana protein match is: unknown protein (TAIR:AT5G53905.1);
Has 1807 Blast hits to 1807 proteins in 277 species:
Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347;
Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source:
NCBI BLink). | chr5:21811633-21814908 FORWARD LENGTH=828
Length=828
Score = 128 bits (69), Expect = 7e-29
Identities = 106/124 (85%), Gaps = 2/124 (2%)
Strand=Plus/Plus
Query 72 AGCCGAGGTG-CTTGTGACACCTCGATTTCCTTCTATTTCTGCTTTTCCAGTTGATCTCA 130
|||||||| | || |||||||| | ||||| ||||||| ||| || |||| |||||||||
Sbjct 420 AGCCGAGG-GACTCGTGACACCGCAATTTCTTTCTATTCCTGGTTCTCCAATTGATCTCA 478
Query 131 CAAAATGTTGGTCATCACTCTTCAACGTTCAAGGTTGCAATATTGAAATCTTGAAATCTG 190
|||||||||||||||||||| ||||| |||||||||| || || || ||||| |||||||
Sbjct 479 CAAAATGTTGGTCATCACTCCTCAACATTCAAGGTTGTAAAATCGAGATCTTTAAATCTG 538
Query 191 CTTT 194
|||
Sbjct 539 TTTT 542
Score = 99.0 bits (53), Expect = 6e-20
Identities = 70/78 (90%), Gaps = 2/78 (3%)
Strand=Plus/Plus
Query 666 GCACCAGATGCATCCGAAGACAATCGTGGCCGTTAATCACGCTCTGA-TCGCACGGTCAT 724
||||| ||||| ||||| ||||||||| ||||| ||||||||||| | |||||||||||
Sbjct 31 GCACCGGATGCGTCCGAGGACAATCGTAGCCGTCAATCACGCTCT-AGCCGCACGGTCAT 89
Query 725 GGTTCATCAACCAGGGTT 742
||||||||||||||||||
Sbjct 90 GGTTCATCAACCAGGGTT 107
> AT5G53905.1 | Symbols: | unknown protein; CONTAINS InterPro
DOMAIN/s: Protein of unknown function DUF1278 (InterPro:IPR010701);
BEST Arabidopsis thaliana protein match is: unknown
protein (TAIR:AT5G54062.1); Has 30201 Blast hits to 17322 proteins
in 780 species: Archae - 12; Bacteria - 1396; Metazoa
- 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes
- 2996 (source: NCBI BLink). | chr5:21888148-21889254
REVERSE LENGTH=507
Length=507
Score = 97.1 bits (52), Expect = 2e-19
Identities = 71/80 (89%), Gaps = 2/80 (3%)
Strand=Plus/Plus
Query 666 GCACCAGATGCATCCGAAGACAATCGTGGCCGTTAATCACGCTCTGA-TCGCACGGTCAT 724
||||| ||||| ||||| ||| ||||| ||||| ||||||||||| | |||||||||||
Sbjct 307 GCACCGGATGCGTCCGAGGACGATCGTAGCCGTCAATCACGCTCT-AGCCGCACGGTCAT 365
Query 725 GGTTCATCAACCAGGGTTAA 744
||||||||||||||||||||
Sbjct 366 GGTTCATCAACCAGGGTTAA 385
> AT5G54070.1 | Symbols: AT-HSFA9, HSFA9 | heat shock transcription
factor A9 | chr5:21943983-21945651 FORWARD LENGTH=1512
Length=1512
Score = 58.4 bits (31), Expect = 1e-07
Identities = 31/31 (100%), Gaps = 0/31 (0%)
Strand=Plus/Plus
Query 714 CGCACGGTCATGGTTCATCAACCAGGGTTAA 744
|||||||||||||||||||||||||||||||
Sbjct 1365 CGCACGGTCATGGTTCATCAACCAGGGTTAA 1395
Lambda K H
1.33 0.621 1.12
Gapped
Lambda K H
1.28 0.460 0.850
Effective search space used: 36118351693
Database: TAIR10_cdna_20110103_representative_gene_model_updated
Posted date: Sep 25, 2014 6:13 PM
Number of letters in database: 51,074,197
Number of sequences in database: 33,602
Matrix: blastn matrix 1 -2
Gap Penalties: Existence: 0, Extension: 2.5