BLASTN 2.2.26+
Reference:
Zheng Zhang, Scott Schwartz, Lukas Wagner, and Webb Miller (2000),
"A greedy algorithm for aligning DNA sequences", J Comput Biol 2000;
7(1-2):203-14.
Database: TAIR10_cdna_20110103_representative_gene_model_updated
33,602 sequences; 51,074,197 total letters
Query= Ahg470598
Length=1153
Score E
Sequences producing significant alignments: (Bits) Value
AT1G05870.1 | Symbols: | Protein of unknown function (DUF1685)... 885 0.0
AT2G31560.1 | Symbols: | Protein of unknown function (DUF1685)... 346 3e-94
AT3G22690.1 | Symbols: | CONTAINS InterPro DOMAIN/s: Protein o... 122 5e-27
> AT1G05870.1 | Symbols: | Protein of unknown function (DUF1685)
| chr1:1772211-1773293 REVERSE LENGTH=878
Length=878
Score = 885 bits (479), Expect = 0.0
Identities = 649/723 (90%), Gaps = 44/723 (6%)
Strand=Plus/Plus
Query 1 ATGGGTTGTGCTCGCTGCAAATCATCAGATCCATGGCAAACATCTGCCAGTGCCTTGGAA 60
|||||||||| |||||||||||||||||||||||||||||||||||||| ||||||
Sbjct 66 ATGGGTTGTGTTCGCTGCAAATCATCAGATCCATGGCAAACATCTGCCAATGCCTT---T 122
Query 61 GAAGACGTCGATGAATCTGGAATCAACGAAGCCTGGGTTGAGATCTCTAACCGCAGATCA 120
||| |||||||||||| ||||||||||||||||||||||||||||| | ||||||||||
Sbjct 123 GAATCCGTCGATGAATCCGGAATCAACGAAGCCTGGGTTGAGATCTCCAGCCGCAGATCA 182
Query 121 TTTGTCTCCGGCGAAGGTAGTAGTCGGAAGAAGCTGGAGAGGAAGAAGAGCCAAGTGTTA 180
|||||| |||||||| | |||||||||||||||| ||||||||||||||||||||||||
Sbjct 183 TTTGTCGCCGGCGAA-G--GTAGTCGGAAGAAGCTAGAGAGGAAGAAGAGCCAAGTGTTA 239
Query 181 CTGGAAGGTTACGTTGAGACTG------C-TGCT--GTGGATGATCAAAAAGACGATCTG 231
|||||||||||||||||||||| | | || |||||||||||||| |||||||||
Sbjct 240 CTGGAAGGTTACGTTGAGACTGCTTCTTCTTCCTCGGTGGATGATCAAAAGGACGATCTG 299
Query 232 ACGAGATCCAAGAGTTTGACGGATGACGATCTCGAAGATCTCAAAGGTTGTTTAGATCTA 291
||||||||||||||||||||||||||||| ||||||||||| | ||||||||||||||||
Sbjct 300 ACGAGATCCAAGAGTTTGACGGATGACGACCTCGAAGATCTTAGAGGTTGTTTAGATCTA 359
Query 292 GGGTTTGGTTTCAGCTACGACGAGATCCCTGAGCTCTGCAACACTTTACCTGCTTTGGAG 351
||||||||||| ||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct 360 GGGTTTGGTTTTAGCTACGACGAGATCCCTGAGCTCTGCAACACTTTACCTGCTTTGGAG 419
Query 352 CTTTGCTATTCAATGAGCCAGAAGTTCTTAGACGATAAGCA---TAAATCACCGGAAAGT 408
||||||||||||||||||||||||||||||||||||||||| |||||||||||||| |
Sbjct 420 CTTTGCTATTCAATGAGCCAGAAGTTCTTAGACGATAAGCAAAATAAATCACCGGAAACT 479
Query 409 TCGTCGGTGGAAGATTCTCCGTCGCCTCCACCTG-TCACCTCCACTCCCATTGCCAATTG 467
|||||||||||||||| |||||||||||||| || ||||| ||||||| |||||||||||
Sbjct 480 TCGTCGGTGGAAGATTGTCCGTCGCCTCCAC-TGGTCACCGCCACTCCGATTGCCAATTG 538
Query 468 GAAGATCTCTAGTCCCGGTGATAATCCGGATGATGTGAAAGCTAGGCTCAAATACTGGGC 527
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct 539 GAAGATCTCTAGTCCCGGTGATAATCCGGATGATGTGAAAGCTAGGCTCAAATACTGGGC 598
Query 528 ACAAGCCGTTGCCTGTACTGTCCAATTGTGCAGCTGAATGAATCATTGTCACTTCTCAAC 587
|||||||||||||||||||||||||||||||||||||||||||||||||||| ||| |||
Sbjct 599 ACAAGCCGTTGCCTGTACTGTCCAATTGTGCAGCTGAATGAATCATTGTCACGTCTGAAC 658
Query 588 CA--A-TAACT--C-AA--A-CATTGT-T-A-GC-TA--T-G-GG-GGAG-A--TCTCAG 626
|| | || || | || | || | | | | || | | | || |||| | | ||||
Sbjct 659 CAGGAATA-CTAGCGAATGAGCAATATCTCAAGCATTGGTAGCGGAGGAGGAGATTTCAG 717
Query 627 AAAG-G-CATTGGAAGGTCTGAAATTTAACTGAATATCAACAATGATTCTCACTTGATTC 684
|||| | ||||||||||||| ||||| |||||||||||||||||| ||||||||| ||||
Sbjct 718 AAAGTGGCATTGGAAGGTCTAAAATTGAACTGAATATCAACAATGGTTCTCACTTAATTC 777
Query 685 CAT 687
|||
Sbjct 778 CAT 780
> AT2G31560.1 | Symbols: | Protein of unknown function (DUF1685)
| chr2:13436557-13438011 FORWARD LENGTH=1169
Length=1169
Score = 346 bits (187), Expect = 3e-94
Identities = 371/456 (81%), Gaps = 28/456 (6%)
Strand=Plus/Plus
Query 142 AGTCGGAAGAAGCTGGAGAGGAAGAAGAGCCAAGTGTTACTGGAAGGTTACGTTGAGACT 201
||| ||||||||||||||| |||||||||||||||||| || | | | | || ||
Sbjct 256 AGTAGGAAGAAGCTGGAGAAGAAGAAGAGCCAAGTGTTGCTTG-A-G----G--GATAC- 306
Query 202 GCTGCTGTGGATGATCAAAAAGACGATCTGACGAGATCCAAGAGTTTGACGGATGACGAT 261
|| | | ||||||||| |||| ||| |||||||| ||||||||||||||||||| |||
Sbjct 307 GC-G-T-TGGATGATC---AAGATGATTTGACGAGAGCCAAGAGTTTGACGGATGATGAT 360
Query 262 CTCGAAGATCTCAAAGGTTGTTTAGATCTAGGGTTTGGTTTCAGCTACGACGAGATCCCT 321
|| || || || ||||||||||||||||||||||||||||| || ||||| |||||||||
Sbjct 361 CTTGAGGAGCTTAAAGGTTGTTTAGATCTAGGGTTTGGTTTTAGTTACGATGAGATCCCT 420
Query 322 GAGCTCTGCAACACTTTACCTGCTTTGGAGCTTTGCTATTCAATGAGCCAGAAGTTCTTA 381
||||| ||||||||||| ||||| || |||||||| || || ||||||||||||||||||
Sbjct 421 GAGCTTTGCAACACTTTGCCTGCGTTAGAGCTTTGTTACTCCATGAGCCAGAAGTTCTTA 480
Query 382 GACGATAAGCAT-AAATC-ACCGGAAAGTTCGTCGGTGGAAGATTCTCCGTCGCCTCCAC 439
|| |||||||| ||| | ||| || || | || ||||||| | ||||||| ||||
Sbjct 481 GATGATAAGCAACAAAACCACCACAA-GTCCCA-GGAGGAAGATGATTCGTCGCCACCAC 538
Query 440 CTGTC-ACCTCCACTCCCATTGCCAATTGGAAGATCTCTAGTCCC-GGTGATAATCCGGA 497
| | | || || |||| ||||| ||||||||||||||||| ||| |||||| |||| ||
Sbjct 539 C-GACCACGACCGCTCCAATTGCAAATTGGAAGATCTCTAG-CCCTGGTGATGATCCAGA 596
Query 498 TGATGTGAAAGCTAGGCTCAAATACTGGGCACAAGCCGTTGCCTGTACTGTCCAATTGTG 557
|||||| |||||||| |||||||| ||||||||| | ||||| || || || | |||||
Sbjct 597 TGATGTAAAAGCTAGACTCAAATATTGGGCACAAACTGTTGCTTGCACCGTACGGTTGTG 656
Query 558 CAGCTGAA-T-GAATC-ATTGTCA-CTTCTCAACCA 589
|||||||| | ||||| ||| ||| ||| | |||||
Sbjct 657 CAGCTGAACTAGAATCGATTTTCATCTT-TGAACCA 691
> AT3G22690.1 | Symbols: | CONTAINS InterPro DOMAIN/s: Protein
of unknown function DUF1685 (InterPro:IPR012881), Pentatricopeptide
repeat (InterPro:IPR002885); BEST Arabidopsis thaliana
protein match is: Tetratricopeptide repeat (TPR)-like superfamily
protein (TAIR:AT2G29760.1); Has 49784 Blast hits to
14716 proteins in 280 species: Archae - 2; Bacteria - 10;
Metazoa - 107; Fungi - 167; Plants - 48594; Viruses - 0; Other
Eukaryotes - 904 (source: NCBI BLink). | chr3:8021229-8024534
REVERSE LENGTH=2935
Length=2935
Score = 122 bits (66), Expect = 5e-27
Identities = 82/89 (92%), Gaps = 4/89 (4%)
Strand=Plus/Plus
Query 450 CACTCCCATTGCCAATTGGAAGATCTCTAGT-CC--CG-GTGATAATCCGGATGATGTGA 505
|||||| |||||||||||||||||||||||| || || |||||||||||||||||||||
Sbjct 2681 CACTCCAATTGCCAATTGGAAGATCTCTAGTCCCGTCGTGTGATAATCCGGATGATGTGA 2740
Query 506 AAGCTAGGCTCAAATACTGGGCACAAGCC 534
||||||| ||||||| |||||||||||||
Sbjct 2741 AAGCTAGACTCAAATGCTGGGCACAAGCC 2769
Lambda K H
1.33 0.621 1.12
Gapped
Lambda K H
1.28 0.460 0.850
Effective search space used: 56576014215
Database: TAIR10_cdna_20110103_representative_gene_model_updated
Posted date: Sep 25, 2014 6:13 PM
Number of letters in database: 51,074,197
Number of sequences in database: 33,602
Matrix: blastn matrix 1 -2
Gap Penalties: Existence: 0, Extension: 2.5