BLASTN 2.2.26+
Reference:
Zheng Zhang, Scott Schwartz, Lukas Wagner, and Webb Miller (2000),
"A greedy algorithm for aligning DNA sequences", J Comput Biol 2000;
7(1-2):203-14.
Database: TAIR10_cdna_20110103_representative_gene_model_updated
33,602 sequences; 51,074,197 total letters
Query= Ahg939225
Length=1980
Score E
Sequences producing significant alignments: (Bits) Value
AT3G61740.1 | Symbols: SDG14, ATX3 | SET domain protein 14 | ch... 671 0.0
AT3G62500.1 | Symbols: | BEST Arabidopsis thaliana protein mat... 398 1e-109
AT5G28320.1 | Symbols: | unknown protein; BEST Arabidopsis tha... 187 3e-46
AT5G28400.1 | Symbols: | unknown protein; BEST Arabidopsis tha... 187 3e-46
AT5G28340.1 | Symbols: | Tetratricopeptide repeat (TPR)-like s... 158 3e-37
AT3G61723.1 | Symbols: | PHD finger protein | chr3:22846405-22... 147 5e-34
AT5G28380.1 | Symbols: | Tetratricopeptide repeat (TPR)-like s... 145 2e-33
> AT3G61740.1 | Symbols: SDG14, ATX3 | SET domain protein 14 |
chr3:22850837-22856972 REVERSE LENGTH=3777
Length=3777
Score = 671 bits (363), Expect = 0.0
Identities = 420/447 (94%), Gaps = 6/447 (1%)
Strand=Plus/Plus
Query 75 AATCTGAAGCGGTGCAAAATTGATTCAGAAATTGAGTATGGGAGCAAAAAGGGTGAGATT 134
|||||||||||||||||||||||||||||||||||||||||||| |||||||||||||||
Sbjct 464 AATCTGAAGCGGTGCAAAATTGATTCAGAAATTGAGTATGGGAGGAAAAAGGGTGAGATT 523
Query 135 ATGGTGTATAAGAAGAGACAAAGAGCAACCGTGGATCAACCATGTAGTAGAGAACCCGAA 194
|| |||||||||||||||||||||||||||||||||||||||||||| | |||||| |||
Sbjct 524 ATAGTGTATAAGAAGAGACAAAGAGCAACCGTGGATCAACCATGTAGCAAAGAACCAGAA 583
Query 195 GTTCATACAAGTAGCTCAAGCTCCTTGACCAGCAAAGAA---TCTCAACAAGTTTGCTCT 251
||| ||| |||||||||||||||||||| ||||||||| ||||||||||||||||||
Sbjct 584 CTTCTTACTAGTAGCTCAAGCTCCTTGACAAGCAAAGAAGAATCTCAACAAGTTTGCTCT 643
Query 252 GACCACTCCAAGTCTTCGCGTGGCAGAGTTCGAGCGGTTCCTTCAAGGTTCAAGGACTCC 311
||||| |||||||||||||||||||||||||||||||||||||| |||||||||||||||
Sbjct 644 GACCAGTCCAAGTCTTCGCGTGGCAGAGTTCGAGCGGTTCCTTCTAGGTTCAAGGACTCC 703
Query 312 ATTGTTGGTTCATGGAAATCTAGCCGTCGCAAGGAAGAGTCGACGGATTCTAGTCATGAT 371
||||||||| |||||||||||||||||||||||| |||||||||||| ||||||||||||
Sbjct 704 ATTGTTGGTACATGGAAATCTAGCCGTCGCAAGGGAGAGTCGACGGAGTCTAGTCATGAT 763
Query 372 GACGACGT-GAATCT--GGGGAAGAAGGTCAAAGGTTTCAGTGGAAGCTCGAAATTGCAT 428
||||||| | |||| |||||||||||||||||||||||||||||||||||||||||||
Sbjct 764 GACGACGACGTATCTCTGGGGAAGAAGGTCAAAGGTTTCAGTGGAAGCTCGAAATTGCAT 823
Query 429 CGAAGCAAAGACTCGAAGCTGTTTCCACATAAGGATAACGGAGACAGCAGCGAAGTAGAT 488
|| ||||||||||||||| ||||||||| ||||||||||||||||||||| |||||||||
Sbjct 824 CGCAGCAAAGACTCGAAGGTGTTTCCACGTAAGGATAACGGAGACAGCAGTGAAGTAGAT 883
Query 489 TGCGATTACTGGGATGTTAAAATTTCC 515
|||||||||||||||||| ||||||||
Sbjct 884 TGCGATTACTGGGATGTTCAAATTTCC 910
Score = 638 bits (345), Expect = 0.0
Identities = 426/463 (92%), Gaps = 13/463 (3%)
Strand=Plus/Plus
Query 1003 AGTTGCGGAAATCCAATCAGTATTGTGGCATTTGCAAGAGAATGTGGCACCCTTCAGATG 1062
||||||||||||||||||||||||||||||| ||||||||||| ||||||||||||||||
Sbjct 1494 AGTTGCGGAAATCCAATCAGTATTGTGGCATCTGCAAGAGAATATGGCACCCTTCAGATG 1553
Query 1063 ATGGAGATTGGGTTTGTTGTGATGGGTGTGATGTATGGGTACATGCTGGGTGCGACAACA 1122
||||||||||||||||||||||||||||||| |||||||||||||||| ||||||||||
Sbjct 1554 ATGGAGATTGGGTTTGTTGTGATGGGTGTGACGTATGGGTACATGCTGAATGCGACAACA 1613
Query 1123 TTTCAAAT-AAGCACTTTAAGGAACTGGAGCACAACAATTACTATTGCCCTAATTGTAAA 1181
|| | ||| || | ||||||||||||||||||||||||||||||||||||| ||||||||
Sbjct 1614 TTACGAATGAA-CGCTTTAAGGAACTGGAGCACAACAATTACTATTGCCCTGATTGTAAA 1672
Query 1182 GTCCAGCATGAGCTTGCGCCATCAATATTAGAAGAACAGAACTCAGTGTTCAAGTCTACA 1241
||||| || |||||| ||||| |||||||||||||||||||||||||||| |||||||||
Sbjct 1673 GTCCAACACGAGCTTACGCCAACAATATTAGAAGAACAGAACTCAGTGTTTAAGTCTACA 1732
Query 1242 AAAAAGGCGACAGAGACTGAGCTGCGTGATGAGGTTACTGTAGTCTGTAATGGCATGGAA 1301
||||| | |||||||||| ||||| ||||| | ||||||||||||||||||||||||||
Sbjct 1733 GAAAAGACAACAGAGACTGGGCTGCCTGATGCGATTACTGTAGTCTGTAATGGCATGGAA 1792
Query 1302 GGAACATATATCAGAAAATTTCATGCAATTGAGTGCAAGTGGGGTTCATGTGGGTCAAGG 1361
|| |||||||||||||||||||||||||||||||||||||| ||||||||||||||||||
Sbjct 1793 GGGACATATATCAGAAAATTTCATGCAATTGAGTGCAAGTGTGGTTCATGTGGGTCAAGG 1852
Query 1362 AAGCAGTCACCAAGTGAATGGGAAAGGCATACAGGCTGCAGAGCCAAAAAGTG----TA- 1416
|||||||||||||||||||||||| |||||||||||||||||||||||||||| ||
Sbjct 1853 AAGCAGTCACCAAGTGAATGGGAACGGCATACAGGCTGCAGAGCCAAAAAGTGGAAGTAT 1912
Query 1417 AG---A-G--TGAAAGACACAATGCTACCTCTAGAAAAATGGA 1453
|| | | |||||||||||||||||||||||||||||||||
Sbjct 1913 AGTGTAAGAGTGAAAGACACAATGCTACCTCTAGAAAAATGGA 1955
Score = 492 bits (266), Expect = 6e-138
Identities = 330/359 (92%), Gaps = 11/359 (3%)
Strand=Plus/Plus
Query 576 CCAGCATGGCCGGCTATGGTGGTCGATCCGATATCACAAGCGCCTGATGGGGTCTTGAAA 635
||||||||||||||| ||||| ||||||||||||||||||||||||||||||||||||||
Sbjct 1028 CCAGCATGGCCGGCTGTGGTGATCGATCCGATATCACAAGCGCCTGATGGGGTCTTGAAA 1087
Query 636 CATTGCGTCCCAGGCGCAATTTGTGTCATGTTTTTTGGGTACTCGAAGAATGGAACTCAG 695
||||||||||| |||||||||||||||||||||||||||||||| ||| |||||||||||
Sbjct 1088 CATTGCGTCCCTGGCGCAATTTGTGTCATGTTTTTTGGGTACTCCAAGGATGGAACTCAG 1147
Query 696 AGGGACTATGCATGGGTCAGACAAGGAATGGTGTATCCATTTACGGAGTTTATGGACAAA 755
||||||||||||||||| |||||||| |||||||||||||||||||| ||||||||||||
Sbjct 1148 AGGGACTATGCATGGGTAAGACAAGGGATGGTGTATCCATTTACGGAATTTATGGACAAA 1207
Query 756 TTTCAGGA-CAAGACAAACTTGTACAATTACAAGCCAAGTGAATTTAAGAAGGCACTTGA 814
|||||||| || ||||||||||| |||||||||| ||||||||||||| |||||||| ||
Sbjct 1208 TTTCAGGATCA-GACAAACTTGTTCAATTACAAGGCAAGTGAATTTAACAAGGCACTAGA 1266
Query 815 TGAAGCAGTTTTAGCAGAAAATGGGGTCGAGGGTAATTGTGGAGATGCTGAAATCAGCTG 874
||||||||||||||||||||||| | | | || ||||||||||||||||| ||
Sbjct 1267 GGAAGCAGTTTTAGCAGAAAATGG--TA-A---T--TT-TGGAGATGCTGAAATCATCTC 1317
Query 875 CCCAGATTCCTCTGCGACGGAATCCGACCAGGACTATGGACCTGCTTCTAGAATACAGG 933
|||||||||||||||||||||||||||||||||||||||||||||||||||| | ||||
Sbjct 1318 CCCAGATTCCTCTGCGACGGAATCCGACCAGGACTATGGACCTGCTTCTAGATTTCAGG 1376
Score = 326 bits (176), Expect = 7e-88
Identities = 197/207 (95%), Gaps = 2/207 (1%)
Strand=Plus/Plus
Query 1499 GCTAAGTGGACG--TGAAAGATGTGCTGTATGCAGATGGGTAGAAGACTGGGAAGAAAAT 1556
|||||||||||| |||||| |||||||||||||||||||||||||||||||||||||||
Sbjct 2048 GCTAAGTGGACGACTGAAAGGTGTGCTGTATGCAGATGGGTAGAAGACTGGGAAGAAAAT 2107
Query 1557 AAAATGATCATCTGTAACAGATGTCAAGTGGCTGTGCACCAAGAATGCTATGGGGTAAGC 1616
|||||||||||||||||||| |||||||||||||||||||||||||| ||||||||||||
Sbjct 2108 AAAATGATCATCTGTAACAGGTGTCAAGTGGCTGTGCACCAAGAATGTTATGGGGTAAGC 2167
Query 1617 AAATCTCAGGACCTCACCTCCTGGGTGTGCAGAGCATGTGAAACACCAGATATTGAGAGA 1676
||||||||||||||||| |||||||| ||||||||||||||||||||||| |||||||||
Sbjct 2168 AAATCTCAGGACCTCACTTCCTGGGTATGCAGAGCATGTGAAACACCAGACATTGAGAGA 2227
Query 1677 GACTGTTGTCTTTGTCCTGTTAAAGGT 1703
|| ||||||||||||||||| ||||||
Sbjct 2228 GATTGTTGTCTTTGTCCTGTAAAAGGT 2254
Score = 193 bits (104), Expect = 7e-48
Identities = 133/147 (90%), Gaps = 2/147 (1%)
Strand=Plus/Plus
Query 1809 TCACAAGCGCCT-ACTGCGGTCTTGAAACATTGCGTCCCGGGCGCGATATGTGTCATGTT 1867
|||||||||||| | || ||||||||||||||||||||| ||||| || |||||||||||
Sbjct 1061 TCACAAGCGCCTGA-TGGGGTCTTGAAACATTGCGTCCCTGGCGCAATTTGTGTCATGTT 1119
Query 1868 TTTTGGGTACTTGAAGAATGGAACTCAGAGCGACTATGCAAGGGTCAGACAAGGAATGAT 1927
||||||||||| ||| ||||||||||||| ||||||||| |||| |||||||| ||| |
Sbjct 1120 TTTTGGGTACTCCAAGGATGGAACTCAGAGGGACTATGCATGGGTAAGACAAGGGATGGT 1179
Query 1928 GTATCCATTTACGGAATTTATGGACAA 1954
|||||||||||||||||||||||||||
Sbjct 1180 GTATCCATTTACGGAATTTATGGACAA 1206
> AT3G62500.1 | Symbols: | BEST Arabidopsis thaliana protein match
is: SET domain protein 14 (TAIR:AT3G61740.1); Has 66 Blast
hits to 66 proteins in 11 species: Archae - 0; Bacteria
- 0; Metazoa - 0; Fungi - 0; Plants - 66; Viruses - 0; Other
Eukaryotes - 0 (source: NCBI BLink). | chr3:23120423-23122222
FORWARD LENGTH=1125
Length=1125
Score = 398 bits (215), Expect = 1e-109
Identities = 267/291 (92%), Gaps = 8/291 (3%)
Strand=Plus/Plus
Query 75 AATCTGAAGCGGTGCAAAATTGATTCAGAAATTGAGTATGGGAGCAAAAAGGGTGAGATT 134
||||||||||||||||||||||||| ||||||||||||||| ||| ||||||||||||||
Sbjct 40 AATCTGAAGCGGTGCAAAATTGATTTAGAAATTGAGTATGGTAGCCAAAAGGGTGAGATT 99
Query 135 ATGGTGTATAAGAAGAGACAAAGAGCAACCGTGGATCAACCATGTAGTAGAGAACCCGAA 194
||||||||||||||||||||| |||||| ||| |||||||||||||| |||||||||| |
Sbjct 100 ATGGTGTATAAGAAGAGACAACGAGCAAGCGTTGATCAACCATGTAGCAGAGAACCCGGA 159
Query 195 GTTCAT-ACAAGTAGCTCAAGCTCCTTGACCAG--CA-AAGAATCTCAACAAGTTTGCTC 250
|||| | |||||||||||||||||||||||||| | |||||||||||||||| |||||
Sbjct 160 GTTC-TAACAAGTAGCTCAAGCTCCTTGACCAGTAGAGAAGAATCTCAACAAGTGTGCTC 218
Query 251 TGACCACTCCAAGTCTTCGCGTGGCAGAGTTCGAGCGGTTCCTTCAAGGTTCAAGGACTC 310
|||||| |||||||||||||||||||||||||||||||||||||||||||||| ||||||
Sbjct 219 TGACCAGTCCAAGTCTTCGCGTGGCAGAGTTCGAGCGGTTCCTTCAAGGTTCAGGGACTC 278
Query 311 CATTGTTGGTTCATGGAAATC-TAGCCG--TCGCAAGGAAGAGTCGACGGA 358
|||||||||| |||||||||| | | | |||||||||||||||||||||
Sbjct 279 CATTGTTGGTACATGGAAATCCTTGAAGAGTCGCAAGGAAGAGTCGACGGA 329
Score = 165 bits (89), Expect = 2e-39
Identities = 126/143 (88%), Gaps = 6/143 (4%)
Strand=Plus/Plus
Query 1003 AGTTGCGGAAATCCAATCAGTATTGTGGCATTTGCAAGAGAATGTGGCA-CCCTTCAGAT 1061
||||||||||||||||||||||||||||||||||||||||| | ||||| ||||||| ||
Sbjct 960 AGTTGCGGAAATCCAATCAGTATTGTGGCATTTGCAAGAGAGTATGGCAGCCCTTCAAAT 1019
Query 1062 GATGGAGATTGGGTTTGTTGTGATGGGTGTGATGTATGGGTACATGCTGGGTGCGACAAC 1121
|||||||||||||||| |||||||| ||||||||||||| | | ||||||||||||
Sbjct 1020 GATGGAGATTGGGTTTATTGTGATGAGTGTGATGTATGGC--CGTA-TGGGTGCGACAAT 1076
Query 1122 A-TTTCAAATAAGCACTTTAAGG 1143
| ||| | |||||| ||||||||
Sbjct 1077 AATTT-ATATAAGCGCTTTAAGG 1098
Score = 159 bits (86), Expect = 7e-38
Identities = 94/98 (96%), Gaps = 0/98 (0%)
Strand=Plus/Plus
Query 411 GGAAGCTCGAAATTGCATCGAAGCAAAGACTCGAAGCTGTTTCCACATAAGGATAACGGA 470
|||||||||||||||||||||||||||||| ||||||||||||||| ||||||||||| |
Sbjct 343 GGAAGCTCGAAATTGCATCGAAGCAAAGACCCGAAGCTGTTTCCACGTAAGGATAACGAA 402
Query 471 GACAGCAGCGAAGTAGATTGCGATTACTGGGATGTTAA 508
|||||||||||||||||||||||||||||||||| |||
Sbjct 403 GACAGCAGCGAAGTAGATTGCGATTACTGGGATGCTAA 440
> AT5G28320.1 | Symbols: | unknown protein; BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT5G28400.1);
Has 1861 Blast hits to 1522 proteins in 246 species: Archae
- 19; Bacteria - 134; Metazoa - 673; Fungi - 145; Plants -
123; Viruses - 8; Other Eukaryotes - 759 (source: NCBI BLink).
| chr5:10301936-10306142 FORWARD LENGTH=2784
Length=2784
Score = 187 bits (101), Expect = 3e-46
Identities = 126/137 (92%), Gaps = 6/137 (4%)
Strand=Plus/Plus
Query 755 ATTTCAGGA-CAAGACAAACTTGTACAATTACAAGCCAAGTGAATTTAAGAAGGCACTTG 813
||||||||| | ||||||||||||||||||||||||| | |||||||||| |||||||
Sbjct 2647 ATTTCAGGATC-AGACAAACTTGTACAATTACAAGCC-A---AATTTAAGAATGCACTTG 2701
Query 814 ATGAAGCAGTTTTAGCAGAAAATGGGGTCGAGGGTAATTGTGGAGATGCTGAAATCAGCT 873
| |||||||||||||||||||||||||||||| ||||||| ||||||||||||||||||
Sbjct 2702 AGGAAGCAGTTTTAGCAGAAAATGGGGTCGAGAATAATTGTAGAGATGCTGAAATCAGCT 2761
Query 874 GCCCAGATTCCTCTGCG 890
|||||||||||||||||
Sbjct 2762 GCCCAGATTCCTCTGCG 2778
> AT5G28400.1 | Symbols: | unknown protein; BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT5G28320.1);
Has 2580 Blast hits to 2028 proteins in 270 species: Archae
- 20; Bacteria - 158; Metazoa - 939; Fungi - 198; Plants -
144; Viruses - 14; Other Eukaryotes - 1107 (source: NCBI BLink).
| chr5:10344024-10348234 REVERSE LENGTH=2922
Length=2922
Score = 187 bits (101), Expect = 3e-46
Identities = 125/136 (92%), Gaps = 4/136 (3%)
Strand=Plus/Plus
Query 755 ATTTCAGGACAAGACAAACTTGTACAATTACAAGCCAAGTGAATTTAAGAAGGCACTTGA 814
||||||||| ||||||||||||||||||||||||| | |||||||||| ||||||||
Sbjct 2785 ATTTCAGGATTAGACAAACTTGTACAATTACAAGCC-A---AATTTAAGAATGCACTTGA 2840
Query 815 TGAAGCAGTTTTAGCAGAAAATGGGGTCGAGGGTAATTGTGGAGATGCTGAAATCAGCTG 874
|||||||||||||||||||||||||||||| ||||||| |||||||||||||||||||
Sbjct 2841 GGAAGCAGTTTTAGCAGAAAATGGGGTCGAGAATAATTGTAGAGATGCTGAAATCAGCTG 2900
Query 875 CCCAGATTCCTCTGCG 890
||||||||||||||||
Sbjct 2901 CCCAGATTCCTCTGCG 2916
> AT5G28340.1 | Symbols: | Tetratricopeptide repeat (TPR)-like
superfamily protein | chr5:10314118-10317160 FORWARD LENGTH=1308
Length=1308
Score = 158 bits (85), Expect = 3e-37
Identities = 126/145 (87%), Gaps = 5/145 (3%)
Strand=Plus/Plus
Query 1003 AGTTGCGGAAATCCAATCAGTATTGTGGCATTTGCAAGAGAATGTGGCACCCTTCAGATG 1062
||||||||||||||||||||||||||||||| ||||||||||| ||||||||| ||||||
Sbjct 131 AGTTGCGGAAATCCAATCAGTATTGTGGCATCTGCAAGAGAATATGGCACCCTCCAGATG 190
Query 1063 ATGGAGATTGGGTTTGTTGTGATGGGTGTGATGTATGGG-----TACATGCTGGGTGCGA 1117
|| |||||||||| ||||||||||||||| | ||||||| | ||||||| |||||
Sbjct 191 ATAGAGATTGGGTATGTTGTGATGGGTGTAACGTATGGGACGGGTTCATGCTGAATGCGA 250
Query 1118 CAACATTTCAAATAAGCACTTTAAG 1142
||||||| | ||| ||| |||||||
Sbjct 251 CAACATTACGAATGAGCGCTTTAAG 275
Score = 139 bits (75), Expect = 9e-32
Identities = 79/81 (98%), Gaps = 0/81 (0%)
Strand=Plus/Plus
Query 1338 AAGTGGGGTTCATGTGGGTCAAGGAAGCAGTCACCAAGTGAATGGGAAAGGCATACAGGC 1397
||||| ||||||||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct 273 AAGTGTGGTTCATGTGGGTCAAGGAAGCAGTCACCAAGTGAATGGGAAAGGCATACAGGC 332
Query 1398 TGCAGAGCCAAAAAGTGTAAG 1418
||||||||||||||||| |||
Sbjct 333 TGCAGAGCCAAAAAGTGGAAG 353
> AT3G61723.1 | Symbols: | PHD finger protein | chr3:22846405-22846925
REVERSE LENGTH=424
Length=424
Score = 147 bits (79), Expect = 5e-34
Identities = 99/109 (91%), Gaps = 0/109 (0%)
Strand=Plus/Plus
Query 1077 TGTTGTGATGGGTGTGATGTATGGGTACATGCTGGGTGCGACAACATTTCAAATAAGCAC 1136
|||||||||||| || | |||||||| |||||||||||||||| |||| | ||| ||| |
Sbjct 152 TGTTGTGATGGGCGTAACGTATGGGTTCATGCTGGGTGCGACATCATTACGAATGAGCGC 211
Query 1137 TTTAAGGAACTGGAGCACAACAATTACTATTGCCCTAATTGTAAAGTCC 1185
|||||||||||||||||||||||||||||||||||| ||||||||||||
Sbjct 212 TTTAAGGAACTGGAGCACAACAATTACTATTGCCCTGATTGTAAAGTCC 260
> AT5G28380.1 | Symbols: | Tetratricopeptide repeat (TPR)-like
superfamily protein | chr5:10338723-10341007 REVERSE LENGTH=1173
Length=1173
Score = 145 bits (78), Expect = 2e-33
Identities = 82/84 (98%), Gaps = 0/84 (0%)
Strand=Plus/Plus
Query 1335 TGCAAGTGGGGTTCATGTGGGTCAAGGAAGCAGTCACCAAGTGAATGGGAAAGGCATACA 1394
|||||||| |||||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct 135 TGCAAGTGTGGTTCATGTGGGTCAAGGAAGCAGTCACCAAGTGAATGGGAAAGGCATACA 194
Query 1395 GGCTGCAGAGCCAAAAAGTGTAAG 1418
|||||||||||||||||||| |||
Sbjct 195 GGCTGCAGAGCCAAAAAGTGGAAG 218
Lambda K H
1.33 0.621 1.12
Gapped
Lambda K H
1.28 0.460 0.850
Effective search space used: 98091864930
Database: TAIR10_cdna_20110103_representative_gene_model_updated
Posted date: Sep 25, 2014 6:13 PM
Number of letters in database: 51,074,197
Number of sequences in database: 33,602
Matrix: blastn matrix 1 -2
Gap Penalties: Existence: 0, Extension: 2.5