BLASTN 2.2.26+
Reference:
Zheng Zhang, Scott Schwartz, Lukas Wagner, and Webb Miller (2000),
"A greedy algorithm for aligning DNA sequences", J Comput Biol 2000;
7(1-2):203-14.
Database: TAIR10_cdna_20110103_representative_gene_model_updated
33,602 sequences; 51,074,197 total letters
Query= Ahg924807
Length=706
Score E
Sequences producing significant alignments: (Bits) Value
AT1G62060.1 | Symbols: | unknown protein; FUNCTIONS IN: molecu... 446 2e-124
AT1G62220.1 | Symbols: | unknown protein; FUNCTIONS IN: molecu... 433 1e-120
AT1G62080.1 | Symbols: | unknown protein; BEST Arabidopsis tha... 425 2e-118
AT1G62000.1 | Symbols: | unknown protein; FUNCTIONS IN: molecu... 412 2e-114
AT1G62214.1 | Symbols: | unknown pseudogene | chr1:22987483-22... 108 9e-23
> AT1G62060.1 | Symbols: | unknown protein; FUNCTIONS IN: molecular_function
unknown; INVOLVED IN: biological_process unknown;
LOCATED IN: endomembrane system; BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT1G62220.1); Has
386 Blast hits to 125 proteins in 33 species: Archae - 6;
Bacteria - 295; Metazoa - 8; Fungi - 17; Plants - 46; Viruses
- 0; Other Eukaryotes - 14 (source: NCBI BLink). | chr1:22941569-22942300
REVERSE LENGTH=732
Length=732
Score = 446 bits (241), Expect = 2e-124
Identities = 388/455 (85%), Gaps = 26/455 (6%)
Strand=Plus/Plus
Query 10 aaaaa-aaTCAAAAGATGAATGCCACAAAGTTTGTTGTGCTTCTCGTGATTGGCGTTTTG 68
||||| |||||||||||||||||||||||||||||||||||||||||||||||| |||||
Sbjct 39 AAAAACAATCAAAAGATGAATGCCACAAAGTTTGTTGTGCTTCTCGTGATTGGCATTTTG 98
Query 69 TGTGCCATTGTCACCGCAAGGCAGGCCGAAGAAGTGTCCAAAGAGACCAAATTAGGCACC 128
||||||||||||||||||||||||| ||||||||||||||||||||||||||||||||||
Sbjct 99 TGTGCCATTGTCACCGCAAGGCAGGTCGAAGAAGTGTCCAAAGAGACCAAATTAGGCACC 158
Query 129 TCTCTTCCAAAAACTACTACCAAAGGCATTGGAGCTCAGCTTTCTGCTTATGG-CACAAC 187
|||||||||||| |||||| |||||||||||||||||||||||||||| ||| | | |
Sbjct 159 TCTCTTCCAAAATCTACTAACAAAGGCATTGGAGCTCAGCTTTCTGCTGCTGGTCTTA-C 217
Query 188 TTACAGCAACT-CT-TACGTCTCTAGCTATGCTAGAGCTTCTAATGGTCCCAAAGGTCCA 245
||||||| | | | |||||||||| |||||| |||| ||| |||||||||||||
Sbjct 218 TTACAGCGGCAGCAGT--GTCTCTAGCTCTGCTAGTGCTTTTAACAATCCCAAAGGTCCA 275
Query 246 GACGCC-GATGCAGCCGAATA-TGGCT-CAACATA-TACCAAT-GGACAAGTCTAT-GCC 299
| ||| | |||| ||||| | ||||| || || | ||||| | ||||||||| || |||
Sbjct 276 GGGGCCAG-TGCATCCGAA-AGTGGCTACA-CA-AGTACCA-TCGGACAAGTC-ATTGCC 329
Query 300 AAGGGTCGCAAGGCAA-ACATTTCTTCTAAAAGTGGTTCTAAAGCTACAGGAGAAGCTGA 358
|||||||||||||||| | |||||||| ||||| |||| |||||||| || ||||
Sbjct 330 AAGGGTCGCAAGGCAAGAG-TTTCTTCTGCAAGTGCTTCTGCCGCTACAGGTGAGGCTGC 388
Query 359 AGCTGCAGCGAATCGAAAAGCTGCTGCTGCACGTGCAAAAGGTTCGGTAAAATCCG-ACT 417
|||||||| || ||| ||||||||||||||||||||||| ||| |||| |||| | |
Sbjct 389 AGCTGCAGTGACTCGCAAAGCTGCTGCTGCACGTGCAAAGGGTAAGGTAGCTTCCGCA-T 447
Query 418 CAAGGGTGAAGGGCAGTTCCTCTGGGAAGAAGAAG 452
||||||||||||| |||||||| ||||||||||
Sbjct 448 CAAGGGTGAAGGG---TTCCTCTGAGAAGAAGAAG 479
> AT1G62220.1 | Symbols: | unknown protein; FUNCTIONS IN: molecular_function
unknown; INVOLVED IN: biological_process unknown;
LOCATED IN: endomembrane system; BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT1G62000.1); Has
131 Blast hits to 122 proteins in 40 species: Archae - 0;
Bacteria - 73; Metazoa - 10; Fungi - 6; Plants - 37; Viruses
- 0; Other Eukaryotes - 5 (source: NCBI BLink). | chr1:22988196-22988844
FORWARD LENGTH=649
Length=649
Score = 433 bits (234), Expect = 1e-120
Identities = 385/456 (84%), Gaps = 18/456 (4%)
Strand=Plus/Plus
Query 10 aaaaa-aaTCAAAAGATGAATGCCACAAAGTTTGTTGTGCTTCTCGTGATTGGCGTTTTG 68
||||| ||||||||||||||||||||||||||||||||||||||||||||| ||||||||
Sbjct 42 AAAAACAATCAAAAGATGAATGCCACAAAGTTTGTTGTGCTTCTCGTGATTAGCGTTTTG 101
Query 69 TGTGCCATTGTCACCGCAAGGCAGGCCGAAGAAGTGTCCAAAGAGACCAAATTAGGCACC 128
||||||||||||||||||||||| | |||||||||||| |||||||||||||||||||||
Sbjct 102 TGTGCCATTGTCACCGCAAGGCATGTCGAAGAAGTGTCTAAAGAGACCAAATTAGGCACC 161
Query 129 TCTCTTCCAAAAACTACTACCAAAGGCATTGGAGCTCAGCTTTCTGCTTATGGCACAACT 188
||||||||||||| |||||||||||||||||||||||||||||||||| ||| | |||
Sbjct 162 TCTCTTCCAAAAAGTACTACCAAAGGCATTGGAGCTCAGCTTTCTGCTGCTGGTATTACT 221
Query 189 TACAGCAACTCTT-ACGTCTCTAGCTATGCTAGAGCTT-CTAATGGTCCCAAAGGTCCAG 246
| ||||| | | | |||| ||||| ||||| | || | || ||||||||||||||
Sbjct 222 TCCAGCAG-TAGTGATGTCTATAGCTCTGCTACTGGTTTC-AACAATCCCAAAGGTCCAG 279
Query 247 ACGCCGATGCAGCCGAATATGGCT-CAACATA-TACCAATGGACAAGTCTAT-GCCAAGG 303
|||| ||||| |||| |||||| || || | ||||| ||||||||| || |||||||
Sbjct 280 ACGCTAATGCATACGAAAATGGCTACA-CA-AGTACCAGCGGACAAGTC-ATTGCCAAGG 336
Query 304 GTCGCAAGGCAA-ACATTTCTTCTAAAAGTGGTTCTAAAGCTACAGGAGAAGCTGAAGCT 362
|||||||||||| | |||||||| ||||| ||||| |||| ||| || ||| |||||
Sbjct 337 GTCGCAAGGCAAGAG-TTTCTTCTGCAAGTGCTTCTACCGCTAAAGGTGATGCTAAAGCT 395
Query 363 GCAGCGAATCGAAAAGCTGCTGCTGCACGTGCAAAAGGTTCGGTAAAATCCG-ACTCAAG 421
|||| || ||| ||||||||||||||||| ||||| ||| |||| || | | |||||
Sbjct 396 GCAGTGACTCGCAAAGCTGCTGCTGCACGAGCAAATGGTAAGGTAGCTTCGGCA-TCAAG 454
Query 422 GGTGAAGGGCAGTTCCTCTGGGAAGAAGAAGGTCAA 457
||||||||| |||||||| ||||||||||| |||
Sbjct 455 GGTGAAGGG---TTCCTCTGAGAAGAAGAAGGGCAA 487
> AT1G62080.1 | Symbols: | unknown protein; BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT1G62000.1);
Has 118 Blast hits to 112 proteins in 34 species: Archae -
0; Bacteria - 43; Metazoa - 10; Fungi - 2; Plants - 36; Viruses
- 0; Other Eukaryotes - 27 (source: NCBI BLink). | chr1:22946242-22946949
REVERSE LENGTH=708
Length=708
Score = 425 bits (230), Expect = 2e-118
Identities = 381/451 (84%), Gaps = 21/451 (5%)
Strand=Plus/Plus
Query 11 aaaaaaTCAAAAGATGAATGCCACAAAGTTTGTTGTGCTTCTCGTGATTGGCGTTTTGTG 70
||||||||||||||||||||||||||||||| ||||||||||||||||||||||||||||
Sbjct 34 AAAAAATCAAAAGATGAATGCCACAAAGTTTCTTGTGCTTCTCGTGATTGGCGTTTTGTG 93
Query 71 TGCCATTGTCACCGCAAGGCAGGCCGAAGAAGTGTCCAAAGAGACCAAATTAGGCACCTC 130
||||||||||||||||||||||| | |||| |||||| ||||||||||||||| ||||
Sbjct 94 TGCCATTGTCACCGCAAGGCAGGTCAAAGATTTGTCCACAGAGACCAAATTAGGGGCCTC 153
Query 131 TCTTCCAAAAACTACTACCAAAGGCATTGGAGCTCAGCTTTCTGCTTA-TGGCACAACTT 189
|||||||||||| |||||||||||||||||||||||||||||||| | |||| ||||||
Sbjct 154 TCTTCCAAAAACGACTACCAAAGGCATTGGAGCTCAGCTTTCTGCA-ACTGGCTCAACTT 212
Query 190 ACAGCAACT-CTTACGTCTC-T-AGCTATGCTAGA-GCTTCTAATGGTCCCAAAGGTCCA 245
||||| | | | || || | |||||||||| | | || ||| ||||||||| |||
Sbjct 213 TCAGCAGCAGC--A-GTGTCGTTAGCTATGCTA-ATGGTTTTAACAATCCCAAAGGCCCA 268
Query 246 GACGCCGATGCAGCCGAATA-TGGCTCAACATATACCAATGGACAAGTCTA-TGCCAAGG 303
| ||| ||||| |||| | |||||| |||| ||||| ||||||||| | ||||||||
Sbjct 269 GGGGCCAATGCATTCGAA-AGTGGCTCCACATTTACCAGCGGACAAGTC-ACTGCCAAGG 326
Query 304 GTCGCAAGGCAA-ACATTTCTTCTAAAAGTGGTTCTAAAGCTACAGGAGAAGCTGAAGCT 362
|||||||||||| | |||||||| ||||| ||||| |||||||| || |||| ||||
Sbjct 327 GTCGCAAGGCAAGAG-TTTCTTCTGCAAGTGCTTCTACCGCTACAGGTGAGGCTGCAGCT 385
Query 363 GCAGCGAATCGAAAAGCTGCTGCTGCACGTGCAAAAGGTTCGGTAAAATCCG-ACTCAAG 421
|||| || ||| ||||||||||||||||||||||| ||| |||| |||| | |||||
Sbjct 386 GCAGTGACTCGCAAAGCTGCTGCTGCACGTGCAAAGGGTAAGGTAGCTTCCGCA-TCAAG 444
Query 422 GGTGAAGGGCAGTTCCTCTGGGAAGAAGAAG 452
||||||||| |||||||| ||||||||||
Sbjct 445 GGTGAAGGG---TTCCTCTGAGAAGAAGAAG 472
> AT1G62000.1 | Symbols: | unknown protein; FUNCTIONS IN: molecular_function
unknown; INVOLVED IN: biological_process unknown;
LOCATED IN: endomembrane system; BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT1G62080.1); Has
163 Blast hits to 56 proteins in 10 species: Archae - 0; Bacteria
- 4; Metazoa - 109; Fungi - 6; Plants - 37; Viruses
- 0; Other Eukaryotes - 7 (source: NCBI BLink). | chr1:22913441-22914146
FORWARD LENGTH=706
Length=706
Score = 412 bits (223), Expect = 2e-114
Identities = 385/459 (84%), Gaps = 27/459 (6%)
Strand=Plus/Plus
Query 11 aaaaaaTCAAAAGATGAATGCCACAAAGTTTGTTGTGCTTCTCGTGATTGGCGTTTTGTG 70
|||||||||||||||||||||||||||||||||||||||||||||||||||| |||||||
Sbjct 38 AAAAAATCAAAAGATGAATGCCACAAAGTTTGTTGTGCTTCTCGTGATTGGCATTTTGTG 97
Query 71 TGCCATTGTCACCGCAAGGCAGGCCGAAGAAGTGTCCAAAGAGACCAAATTAGGCACCTC 130
||||||||||||||||||||||| | |||| |||||| |||||||||||||| ||||
Sbjct 98 TGCCATTGTCACCGCAAGGCAGGTCAAAGATTTGTCCACGGAGACCAAATTAGGGGCCTC 157
Query 131 TCTTCCAAAAACTACTACCAAAGGCATTGGAGCTCAGCTTTCTGCTTA-TGGCACAACTT 189
||||||||||||||||||||||||||||||||||||||||||||| | |||||||||||
Sbjct 158 TCTTCCAAAAACTACTACCAAAGGCATTGGAGCTCAGCTTTCTGCA-ACTGGCACAACTT 216
Query 190 ACAGCAACT-CTTACGTCTC-T-AGCTATGCTAGA-GCTTCTAATGGTCCCAAAGGTCCA 245
|||||| | | | || || | |||||||||| | | || ||| |||||||||||||
Sbjct 217 ACAGCACCAGC--A-GTGTCGTTAGCTATGCTA-ATGGTTTTAACAATCCCAAAGGTCCA 272
Query 246 GACGCCGATGCAGCCGAATA-TGGCTCAACATAT--ACCAATGGACAAGTCTA-TGCCAA 301
| ||| || || |||| | || | ||||| || |||| ||||||||| | ||||||
Sbjct 273 GGGGCCAATTCATTCGAA-AGTG-C-CAACACATTTACCAGCGGACAAGTC-ACTGCCAA 328
Query 302 GGGTCGCAAGGCAA-ACATTTCTTCTAAAAGTGGTTCTAAAGCT-ACAGGAGAAGCTGAA 359
|||||||||||||| | ||||||||| ||||| |||| ||| | ||| || |||| |
Sbjct 329 GGGTCGCAAGGCAAGAG-TTTCTTCTACAAGTGCTTCTGCCGCTGA-AGGCGATGCTGCA 386
Query 360 GCTGCAGCGAATCGAAAAGCTGCTGCTGCACGTGCAAAAGGTTCGGTAAAATCCG-ACTC 418
||||||| || ||| ||||||||||||||||||||||| ||| |||| |||| | ||
Sbjct 387 GCTGCAGTGACTCGCAAAGCTGCTGCTGCACGTGCAAACGGTAAGGTAGCTTCCGCA-TC 445
Query 419 AAGGGTGAAGGGCAGTTCCTCTGGGAAGAAGAAGGTCAA 457
|||||||||||| |||||||| ||||||||||| |||
Sbjct 446 AAGGGTGAAGGG---TTCCTCTGAGAAGAAGAAGGGCAA 481
> AT1G62214.1 | Symbols: | unknown pseudogene | chr1:22987483-22987938
REVERSE LENGTH=363
Length=363
Score = 108 bits (58), Expect = 9e-23
Identities = 70/76 (92%), Gaps = 0/76 (0%)
Strand=Plus/Plus
Query 79 TCACCGCAAGGCAGGCCGAAGAAGTGTCCAAAGAGACCAAATTAGGCACCTCTCTTCCAA 138
|||||| ||||||| |||||| |||||||| |||||||||||||| |||||||||||||
Sbjct 63 TCACCGTGAGGCAGGTCGAAGATGTGTCCAAGGAGACCAAATTAGGAACCTCTCTTCCAA 122
Query 139 AAACTACTACCAAAGG 154
||||||||||||||||
Sbjct 123 AAACTACTACCAAAGG 138
Lambda K H
1.33 0.621 1.12
Gapped
Lambda K H
1.28 0.460 0.850
Effective search space used: 34209454107
Database: TAIR10_cdna_20110103_representative_gene_model_updated
Posted date: Sep 25, 2014 6:13 PM
Number of letters in database: 51,074,197
Number of sequences in database: 33,602
Matrix: blastn matrix 1 -2
Gap Penalties: Existence: 0, Extension: 2.5