BLASTN 2.2.26+
Reference:
Zheng Zhang, Scott Schwartz, Lukas Wagner, and Webb Miller (2000),
"A greedy algorithm for aligning DNA sequences", J Comput Biol 2000;
7(1-2):203-14.
Database: TAIR10_cdna_20110103_representative_gene_model_updated
33,602 sequences; 51,074,197 total letters
Query= Ahg893687
Length=1920
Score E
Sequences producing significant alignments: (Bits) Value
AT3G01345.1 | Symbols: | Expressed protein | chr3:129138-13074... 1061 0.0
AT5G28919.1 | Symbols: | FUNCTIONS IN: molecular_function unkn... 337 3e-91
AT5G35760.1 | Symbols: | Beta-galactosidase related protein | ... 172 9e-42
AT2G06845.1 | Symbols: | Beta-galactosidase related protein | ... 150 4e-35
AT1G47770.1 | Symbols: | Beta-galactosidase related protein | ... 113 5e-24
> AT3G01345.1 | Symbols: | Expressed protein | chr3:129138-130749
FORWARD LENGTH=1612
Length=1612
Score = 1061 bits (574), Expect = 0.0
Identities = 1114/1362 (82%), Gaps = 87/1362 (6%)
Strand=Plus/Plus
Query 111 CGGCACCGGTAGAG-C-TTTCGCTCACCGGTGCCGGGAGCTTTC-TCCTCCTCGTCGATC 167
|||||||||||||| | ||| |||| ||||||||||||||| || | | ||| |||| ||
Sbjct 75 CGGCACCGGTAGAGCCGTTT-GCTCGCCGGTGCCGGGAGCTATCGTTC-CCTTGTCGCTC 132
Query 168 TAC-TCTCAAAGGTTTTATTTTCTGGT-TATCTGC-ATTCTCTTGT-TTCCTTC-TGTTT 222
| | |||||| || | | | |||||| | ||| | |||||| || |||| | |||||
Sbjct 133 T-CTTCTCAATGGCTATGTCATCTGGTGT-TCT-CTATTCTCCCGTGTTCC--CATGTTT 187
Query 223 CGGC-TCTCTTCCCTCCCCTTT-TATCCAAAAAGCTTTA---TGACC-GCATTACCTTC- 275
| || | |||| || |||||| ||||| |||| ||| ||||| || |||| ||
Sbjct 188 C-GCTTGTCTTTTCT-CCCTTTGTATCC--AAAG-ATTACCTTGACCACCA-TACC-TCT 240
Query 276 GAGC-C-TGCTCTTATTTGGTACTAGTTCTCCTCTTAACCTT-TTCGCGATTCT-CGTGG 331
|| | | | | ||| || || |||||||||| ||| | ||| || |||| || |||||
Sbjct 241 GAACACGTTCACTT-CTT-GTGCTAGTTCTCCCCTT-ATCTTCTTTGCGA-CCTCCGTGG 296
Query 332 ATGATCCCTCTTCTCGGCTCTCCTTGAAGTCAAGATCAGCATCTCCTTGTTCCTCGATGG 391
| |||||||| |||||||| |||||||||||| | | |||||||| | |||| | |||
Sbjct 297 TTTATCCCTCTCCTCGGCTCCCCTTGAAGTCAATACCGGCATCTCCCTATTCCCCTCTGG 356
Query 392 ATGGGAGCTCTGCCTCGACGCCGGAGCTGTGGAAGCATC-AAACCATGGGCGAAT-CAAC 449
|| |||||| ||| | |||||||||||||| |||| | | ||| ||| | || ||||
Sbjct 357 ATCGGAGCTTTGCTTTGACGCCGGAGCTGTTGAAG-AGCTTTACCTTGGAAG-ATCCAAC 414
Query 450 TTCTGCAGACCGAGCTTCAT--CATCCGCATCTTTTGATTGGGTAGATCTTTTCTCAGAA 507
||||||||||||||| |||| || | | | || | |||| |||||||| |||| ||
Sbjct 415 TTCTGCAGACCGAGCCTCATCGCAAAC-AAAC-TTCGTTTGGTTAGATCTTGTCTCTGAC 472
Query 508 GCCCCCAAATCGGTC-GCCGGACAGCCC-AAATCCGTTCTGCCGCCGCCG-C-T-TTTGG 562
|||| ||| |||| | || | | | ||| || |||||| ||||| ||||| | | ||||
Sbjct 473 GCCCTCAATTCGG-CAGCTGTA-AACCCTAAGTCCGTTTTGCCGTCGCCGTCGTCGTTGG 530
Query 563 TGAGGATGAGAAGAAGAACCT---GGCTAAACCTAATGGTGCCTC---G--TT----TTG 610
||| | ||||||| | | | |||||||||||||| |||||| | || |||
Sbjct 531 TGACAAAGAGAAGATGCGCTTGGCGGCTAAACCTAATGCTGCCTCAGGGTTTTCGCATTG 590
Query 611 GGCCTAGCCCAAGTCCCTTTTGTCCTCCTTCC-AGCACATTACCCAAACT-TTCGAAGCC 668
|||| | |||||| ||||| |||||||| || | | ||||||| ||||| || |||||
Sbjct 591 GGCCCAACCCAAGCCCCTTCTGTCCTCC-CCCTAACCCATTACCTAAACTATT-AAAGCC 648
Query 669 CACGAAGACTTCCGCTGGGTTAAGCTCATGTCATCTAAGGCCCAGGTCTGATTTT-AATT 727
|| |||| |||| |||||||||||| ||| || ||||||||||||||||| |||| || |
Sbjct 649 CATGAAGCCTTCTGCTGGGTTAAGCCCATATCCTCTAAGGCCCAGGTCTG-TTTTCAAAT 707
Query 728 ATGTTTTACAAAGCCCAAATTTTCTGAAGTTGTTATTGTTTAGGGAGAGTAATGAACGCT 787
|||||||||||||||||||||| ||| |||||||| |||||||||||||||| |||||
Sbjct 708 CTGTTTTACAAAGCCCAAATTTTTTGATGTTGTTATCATTTAGGGAGAGTAATGGACGCT 767
Query 788 TTCCTGTGATGGTGATGATGGATTTAGCTTGGATTAGTAGTTCTCTTTCCCAAAACCTGG 847
|||||||||||||||||||||||||||||||||||||||| ||||| || |||||||| |
Sbjct 768 TTCCTGTGATGGTGATGATGGATTTAGCTTGGATTAGTAGCTCTCTCTCACAAAACCTAG 827
Query 848 TCGATAGTTTATCTCGTTTGGTCCGTTTGCTTCA-TCGTTTGAGTCTAGTCTTGATTCGT 906
| |||||| ||||||||||||||||||| ||||| | |||||||||||||||||| ||||
Sbjct 828 TTGATAGTCTATCTCGTTTGGTCCGTTTTCTTCAGT-GTTTGAGTCTAGTCTTGAATCGT 886
Query 907 TTGGAGGTAATTTCTCTTTGGAAGTGTCCTAGAGTGTTGCTGCCCGTGCCAAGTATTATT 966
|||||| || ||||||||||||||||||||||||||||||||||| |||||||| ||
Sbjct 887 TTGGAGCTATTTTCTCTTTGGAAGTGTCCTAGAGTGTTGCTGCCCACACCAAGTATCGTT 946
Query 967 AATTTCCTATCAAGTTCCCTCCCTTTAGCACCCTCCAACTCTGTCTTGG-CTGGTAATGG 1025
||||| ||||| |||||||||| |||||||||||| | |||||| |||| ||||||||||
Sbjct 947 AATTTACTATCGAGTTCCCTCCTTTTAGCACCCTCTACCTCTGT-TTGGGCTGGTAATGG 1005
Query 1026 ACGCCACGCTAACAGAGTAATGGTTTGTTTGGGCTGGCT-TGATAGTTACCTTTGTCGAG 1084
||||| |||| |||| || |||||||||| || ||| || |||||||| ||||||||
Sbjct 1006 ACGCCTTGCTATCAGAATAGTGGTTTGTTTAGG-TGGGAATGGTAGTTACCATTGTCGAG 1064
Query 1085 ATCAGATGCTAAGTTTACTTTGGACTCTATCCAAGACTCTCTT-ACCCAACCATATGCAA 1143
|| ||||| ||||||| |||||||| |||||| |||||| ||| ||||| ||||| ||||
Sbjct 1065 ATAAGATGTTAAGTTTTCTTTGGACCCTATCCGAGACTC-CTTTACCCACCCATAGGCAA 1123
Query 1144 TTGGCCTTTAGAGTTAAGAAGACCGGCATTATGACCCTGTCTCTACGGAGTAGGTGTTAC 1203
|||||||||||||||||| || ||||||||||||||||||||||| |||||||||||||
Sbjct 1124 ATGGCCTTTAGAGTTAAGAGGATCGGCATTATGACCCTGTCTCTACCGAGTAGGTGTTAC 1183
Query 1204 CGAAGCTTTTTCAACTCTCTACCTACCCATTCACCAATTATCGAGTTCAGTCATGT---G 1260
|||||| || ||||||| || || ||| |||||| |||||| | ||| |||||| |
Sbjct 1184 CGAAGCCTTGAAAACTCTCAACTTATCCACTCACCATTTATCGCGCTCATTCATGTCATG 1243
Query 1261 TTAATCTATTGCTTGGATAATTTGCAATCTTCTGGT-AGGTTGGAAAAGTACGGCATTAT 1319
|||||||||||||||||||||||||||||| || | | |||| ||||||||||||||||
Sbjct 1244 TTAATCTATTGCTTGGATAATTTGCAATCT-CTTGACATGTTGAAAAAGTACGGCATTAT 1302
Query 1320 GACCATGTCTTCAAGAGGTGGTTACCGTCTCTTTT-TCAACCTCGTTAACCCGTCAGCCT 1378
|||||||||||||||| ||||||||||||| |||| ||||||||||||||||||||||||
Sbjct 1303 GACCATGTCTTCAAGAAGTGGTTACCGTCT-TTTTCTCAACCTCGTTAACCCGTCAGCCT 1361
Query 1379 CCTCTGCGGAGCACCT-TTCCAAAAGCTCGGATGCATTGTTT 1419
|||| |||||||| | ||| |||||||||||||||||||||
Sbjct 1362 CCTCAACGGAGCACTTCTTC-AAAAGCTCGGATGCATTGTTT 1402
> AT5G28919.1 | Symbols: | FUNCTIONS IN: molecular_function unknown;
INVOLVED IN: biological_process unknown; LOCATED IN:
cellular_component unknown; BEST Arabidopsis thaliana protein
match is: Expressed protein (TAIR:AT3G01345.1); Has 30201
Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria
- 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037;
Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink).
| chr5:10940033-10941771 FORWARD LENGTH=1739
Length=1739
Score = 337 bits (182), Expect = 3e-91
Identities = 281/328 (86%), Gaps = 10/328 (3%)
Strand=Plus/Plus
Query 1097 GTTTACTTTGGACTCTATCCAAGACT-CTCTTACCCAACCATATG-CAATTGGC-CTTTA 1153
|||| |||||||| || ||| ||||| | ||||||||||||| | | ||| || ||||
Sbjct 1225 GTTTCCTTTGGACCCTTTCCGAGACTAGT-TTACCCAACCATA-GTC-ATTTGCTTTTTA 1281
Query 1154 GAGTTAAGAAG-ACCGGCATTATGACCCTGTCTCTACGGAGTAGGTGTTACCGAAGCTTT 1212
|||||||| || ||||||||||||||||| | ||||||||||||||||||||||||| |
Sbjct 1282 GAGTTAAG-AGTACCGGCATTATGACCCTATTTCTACGGAGTAGGTGTTACCGAAGCCTC 1340
Query 1213 TTCAACTCTCTACCTACCCATTCACCAATTATCGAGTTCAGTCATGTGTTAATCTATTGC 1272
| |||||||||| |||||||| ||||||||||| || |||||||||| ||||||| ||||
Sbjct 1341 TACAACTCTCTATCTACCCATACACCAATTATCAAGGTCAGTCATGTATTAATCTTTTGC 1400
Query 1273 TTGGATAATTTGCAATCTTCTGGTAGGTTGGAAAAGTACGGCATTATGACCATGTCTTCA 1332
||||||||||||||| |||||| | | |||||||||||||||||||||||||||||||
Sbjct 1401 TTGGATAATTTGCAACCTTCTGTTTGTTTGGAAAAGTACGGCATTATGACCATGTCTTTT 1460
Query 1333 AGAGGTGGTTACCGTCTCTTTTTCAACCTCGTTAACCCGTCAGCCTCCTCTGCGGAGCAC 1392
|||| |||||||||| || ||||||||||| ||||| || ||||||| | |||||||
Sbjct 1461 AGAGTTGGTTACCGTTTCCTTTTCAACCTCACCAACCCAACATCCTCCTCCGTGGAGCAC 1520
Query 1393 CTT-TCCAAAAGCTCGGATGCATTGTTT 1419
| || |||||||||||||||||||||
Sbjct 1521 ATGATC-AAAAGCTCGGATGCATTGTTT 1547
> AT5G35760.1 | Symbols: | Beta-galactosidase related protein
| chr5:13934108-13934782 FORWARD LENGTH=534
Length=534
Score = 172 bits (93), Expect = 9e-42
Identities = 163/196 (83%), Gaps = 7/196 (4%)
Strand=Plus/Plus
Query 1673 CCGTGGAGCA-TTTTCTCAAAAGCTCGTTTGCGTTATTCGCGCGAGCCGTCTATGACCAT 1731
|||||||||| ||| ||||| |||| | ||| || || || |||||| ||||||||
Sbjct 197 CCGTGGAGCACCTTT-TCAAAGGCTCATATGCATTGTTTGCAATAGCCGTTTATGACCA- 254
Query 1732 C-CGTTGGTCGAGGATTTCGCGAAGCCCGTCTTCATGGTCGAATCCGCTATGGCCTCAAA 1790
| |||||||||||||||||||||||||||||||||||||||||||||| | |||||||
Sbjct 255 CACGTTGGTCGAGGATTTCGCGAAGCCCGTCTTCATGGTCGAATCCGCCAAAGCCTCAAC 314
Query 1791 AGATTCTTCAAACTTTGCAAACCTTTTTAAGA-TGAGCATCAT-TTTGGAAAACTCATGG 1848
||||| ||||||| ||| |||| ||||||| | ||| || | | |||| ||||||||||
Sbjct 315 AGATTTTTCAAACCTTGTAAACTTTTTTAAAAGTGAACACC-TCTTTGCAAAACTCATGA 373
Query 1849 AGCCTTTATCTTTATT 1864
||||| |||||| |||
Sbjct 374 AGCCTCTATCTTCATT 389
Score = 124 bits (67), Expect = 2e-27
Identities = 77/82 (94%), Gaps = 0/82 (0%)
Strand=Plus/Plus
Query 1433 CCAAAAGGGTGAAGAAGAACGGCATTATGATCCCGTCTCTACGGAGCGGTGGTTACCGAA 1492
||||||| ||||||||||||| |||||||||||| |||| ||||||||||||||||||||
Sbjct 5 CCAAAAGAGTGAAGAAGAACGCCATTATGATCCCATCTCCACGGAGCGGTGGTTACCGAA 64
Query 1493 GCTTCTTCAACTCTCTATCTCC 1514
||||||||||||||||| ||||
Sbjct 65 GCTTCTTCAACTCTCTACCTCC 86
> AT2G06845.1 | Symbols: | Beta-galactosidase related protein
| chr2:2754666-2756008 FORWARD LENGTH=948
Length=948
Score = 150 bits (81), Expect = 4e-35
Identities = 91/96 (95%), Gaps = 0/96 (0%)
Strand=Plus/Plus
Query 1324 ATGTCTTCAAGAGGTGGTTACCGTCTCTTTTTCAACCTCGTTAACCCGTCAGCCTCCTCT 1383
|||||||||||||||||||||||||||||| ||||||||||||||||||||||||||||
Sbjct 1 ATGTCTTCAAGAGGTGGTTACCGTCTCTTTCTCAACCTCGTTAACCCGTCAGCCTCCTCC 60
Query 1384 GCGGAGCACCTTTCCAAAAGCTCGGATGCATTGTTT 1419
| ||||||||||| ||||||||| ||||||||||||
Sbjct 61 GTGGAGCACCTTTTCAAAAGCTCAGATGCATTGTTT 96
> AT1G47770.1 | Symbols: | Beta-galactosidase related protein
| chr1:17590266-17591261 FORWARD LENGTH=618
Length=618
Score = 113 bits (61), Expect = 5e-24
Identities = 91/105 (87%), Gaps = 4/105 (4%)
Strand=Plus/Plus
Query 1716 AGCCGTCTATGACCATC-CGTTGGTCGAGGATTTCGCGAAGCCCGTCTTCATGGTCGAAT 1774
||||||||||||||| | ||||||||||||||||||||||||||||| |||||||||||
Sbjct 513 AGCCGTCTATGACCA-CACGTTGGTCGAGGATTTCGCGAAGCCCGTCAACATGGTCGAAT 571
Query 1775 CCG-CTATGGCCTCAAAAGATTCTTCAAACTTTGCAAACCTTTTT 1818
||| | | ||| |||| ||||| ||||||| ||| ||| |||||
Sbjct 572 CCGTCAA-GGCTTCAACAGATTTTTCAAACCTTGTGAACTTTTTT 615
Lambda K H
1.33 0.621 1.12
Gapped
Lambda K H
1.28 0.460 0.850
Effective search space used: 95079832230
Database: TAIR10_cdna_20110103_representative_gene_model_updated
Posted date: Sep 25, 2014 6:13 PM
Number of letters in database: 51,074,197
Number of sequences in database: 33,602
Matrix: blastn matrix 1 -2
Gap Penalties: Existence: 0, Extension: 2.5