BLASTN 2.2.26+
Reference:
Zheng Zhang, Scott Schwartz, Lukas Wagner, and Webb Miller (2000),
"A greedy algorithm for aligning DNA sequences", J Comput Biol 2000;
7(1-2):203-14.
Database: TAIR10_cdna_20110103_representative_gene_model_updated
33,602 sequences; 51,074,197 total letters
Query= Ahg475001
Length=1638
Score E
Sequences producing significant alignments: (Bits) Value
AT1G63540.1 | Symbols: | hydroxyproline-rich glycoprotein fami... 1223 0.0
AT1G63530.1 | Symbols: | BEST Arabidopsis thaliana protein mat... 274 2e-72
> AT1G63540.1 | Symbols: | hydroxyproline-rich glycoprotein family
protein | chr1:23566958-23569495 FORWARD LENGTH=1997
Length=1997
Score = 1223 bits (662), Expect = 0.0
Identities = 983/1125 (87%), Gaps = 73/1125 (6%)
Strand=Plus/Plus
Query 552 TGGTTCACCGTTTGGGAACAA---TGCTTTTGCGATACCTGATGTTGGTAGCTCGCCAGT 608
||||||||||||||||||||| ||||||||| | |||||||||||||| |||||||||
Sbjct 908 TGGTTCACCGTTTGGGAACAATGTTGCTTTTGCCAGACCTGATGTTGGTATCTCGCCAGT 967
Query 609 AGCTTCCTCCTCAACTACTACAGAAGTTTTTGGTGCAACTCCAACGA-TTT-TTCCTTCA 666
|||||||||||| |||| ||| ||| ||||||||||||||||| ||| ||| || ||||
Sbjct 968 AGCTTCCTCCTCCACTAGTACTGAAATTTTTGGTGCAACTCCAGCGAGTTTGTT--TTCA 1025
Query 667 CCTTTTGGGCCAAAGCAAGCTCCTGTCCAAGCTAGTGCATCTAGCACTTTCACA-TCCCC 725
||||||||||||| ||||||||||||||||||||||||||||||||||| |||| |||||
Sbjct 1026 CCTTTTGGGCCAATGCAAGCTCCTGTCCAAGCTAGTGCATCTAGCACTTCCACATTCCCC 1085
Query 726 CCTATTTGGTTGCACCCCAGCATCTCCGACGACTGGTACTTCGCTGTTCAACTCTG--TT 783
|| |||||||||| |||| |||||||| ||| |||| |||||||||||||||||| ||
Sbjct 1086 CC-ATTTGGTTGCGTCCCACCATCTCCGTCGAGTGGTTCTTCGCTGTTCAACTCTGCTTT 1144
Query 784 T--TT-CGT-TC--CA-------C-GCCAGC---AT----C-ATCCTCTTCCGACCTTTT 821
| || | | | || | ||| | | | |||||||||| ||||| |
Sbjct 1145 TGGTTCCCTGCCAGCACCCTCTTCTTCCAACTTTTTCGGACAATCCTCTTCCAACCTTCT 1204
Query 822 CGGACAAAACCCATCGACTACTGGTGTTGGTT-CCTTGCCTGGATCTCCATTGAACAGTT 880
|||||||||||||||||||||||||||||||| || ||||||||||||||||||||||||
Sbjct 1205 CGGACAAAACCCATCGACTACTGGTGTTGGTTACC-TGCCTGGATCTCCATTGAACAGTT 1263
Query 881 GTATTCCTGGATTTGGTGTTGGTTACCTGCCTGGATCTTCCTCCAACCTATTTAGATCAA 940
| |||| ||||||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct 1264 CTTTTCCCGGATTTGGTGTTGGTTACCTGCCTGGATCTTCCTCCAACCTATTTAGATCAA 1323
Query 941 ACCCACCAAATTTTGGTGGTGGTTCAGTCGGTGCAGGTCCTCAACGTTTTGGTCTGAATG 1000
|||||||||||||||||||||||||| |||||||||||||||||| ||| ||| | ||||
Sbjct 1324 ACCCACCAAATTTTGGTGGTGGTTCAATCGGTGCAGGTCCTCAACATTTCGGTTTCAATG 1383
Query 1001 GAGCTACTA-TGT-TT-CCAAG-ATCGCCTTTTTCA-TCATCCCCTGCATTTAGCAACAA 1055
||| | || ||| || || || | ||||||||||| | |||||||||||||||| |||
Sbjct 1384 GAGATGCTTCTGTGTTGCCGAGCA-CGCCTTTTTCACTG-TCCCCTGCATTTAGCAGCAA 1441
Query 1056 TCCCAACACTGGATCCTATCCTTTTGCATCTCATGAATGGAGTCGCCCCACTGAACAAGG 1115
| |||| |||||||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct 1442 TACCAATACTGGATCCTATCCTTTTGCATCTCATGAATGGAGTCGCCCCACTGAACAAGG 1501
Query 1116 TAGTAGGAATCCTGGTTATGCGCCTACACATGACGGAGAAAACACATCTGGTTGGAGTTT 1175
||||| |||||||||||||||||| |||||||| || || ||| ||||||||||||||||
Sbjct 1502 TAGTATGAATCCTGGTTATGCGCCAACACATGAAGGGGATAACTCATCTGGTTGGAGTTT 1561
Query 1176 CCCCACTGAAAAAGGAAAAGGCGAA-ATCTATATTTCCATATCTGCTTCCAAACCTTATC 1234
||||||| | | ||| ||| || || |||||||||||||||||||||||||||||||
Sbjct 1562 CCCCACT-A----GCAAA-GGC-AACATTTATATTTCCATATCTGCTTCCAAACCTTATC 1614
Query 1235 TACATAAAAGCCATGAAGAACTAAGGTGGGAAGATTACAAGCAAGGAGTCAAAGGTGGGT 1294
|||| ||||||||||||||||||||||||||||||||||||||||||| ||||||||||
Sbjct 1615 TACACAAAAGCCATGAAGAACTAAGGTGGGAAGATTACAAGCAAGGAGACAAAGGTGGGC 1674
Query 1295 CGTTTCCTGCTGCTCCTGCATCTCCCATAGGCTCAAGGCCAAACTTTGCTTTTCCACCAT 1354
||||||||||||||||||| ||| |||||||||||||||||||| ||||||| |||||
Sbjct 1675 CGTTTCCTGCTGCTCCTGCTTCTACCATAGGCTCAAGGCCAAACGCTGCTTTTTCACCA- 1733
Query 1355 TAAATAGACCCCATGAAACCGCAACTATTTCTCCCCCAGCACATGGATGCACTGCATGTG 1414
|| | | | | | || ||||||||||||||||||||||||||||||
Sbjct 1734 --------CC--A--A--CTG----T-TT-CTCCCCCAGCACATGGATGCACTGCATGTG 1773
Query 1415 GAGCAACGAGTAGCTCCTCTGCTTCTGGTCATTTCACCTTTAATGGTACCACAACTCCTC 1474
||||||| |||||||| |||||||| |||| ||||||||||||||| |||||| |||||
Sbjct 1774 GAGCAACCAGTAGCTCATCTGCTTCCCGTCACTTCACCTTTAATGGTGCCACAAGTCCTC 1833
Query 1475 CATCAGCTGCTACAACTCCTCCCGGGTTGTTCTTTCCTACCAGTGGTTTTGGC-CCTATG 1533
||||||||||||||||||||||||||||||||||||||| || |||||||| | ||||||
Sbjct 1834 CATCAGCTGCTACAACTCCTCCCGGGTTGTTCTTTCCTAGCACTGGTTTTG-CTCCTATG 1892
Query 1534 ATGTTTGGTACAACTCTTGCTGTTCAAGGCACAACTCCAGCACTTCAAGCCTATCCTATT 1593
||||||||||||| |||||||||||||||||||| |||||||||||||||||||||| ||
Sbjct 1893 ATGTTTGGTACAAATCTTGCTGTTCAAGGCACAAGTCCAGCACTTCAAGCCTATCCTGTT 1952
Query 1594 CAAGGTTACATTCTTCTCCCGTTCGCCGCCATGAGTCTGCAGTAA 1638
|||||||||||||||||||||||||||||||||| ||||||||||
Sbjct 1953 CAAGGTTACATTCTTCTCCCGTTCGCCGCCATGACTCTGCAGTAA 1997
Score = 440 bits (238), Expect = 2e-122
Identities = 351/403 (87%), Gaps = 17/403 (4%)
Strand=Plus/Plus
Query 1 TTGTTGTTGTCTCCGTAAGCTTCTCCTTCATTCACTTGGTCGTTCCCCTGCTTCCTAC-T 59
|||||||||||||||||||||||||||||| ||||||||||| | ||||||||||||| |
Sbjct 2 TTGTTGTTGTCTCCGTAAGCTTCTCCTTCAATCACTTGGTCGGTGCCCTGCTTCCTACTT 61
Query 60 TGATAAAGTTTGAAAGATTGAAAGAAGAATGAAGATTGATTTTTCTGAGCCAGAGTGCAG 119
|||||||||||||||||| ||||||||||||||||||||||||||||||||||||||||
Sbjct 62 TGATAAAGTTTGAAAGATCAAAAGAAGAATGAAGATTGATTTTTCTGAGCCAGAGTGCAG 121
Query 120 CAACTGTTACCAGTGTAGCAATCCAGGCATACTCAATACTCAAGAATCCGAACCGAG-AG 178
||||||||||||| || |||||||| | |||| ||||||| ||||||||||| ||| |
Sbjct 122 TAACTGTTACCAGTTTACCAATCCAGACCTACTTAATACTCCAGAATCCGAACAGAGCA- 180
Query 179 ATAG-GATTGACTCATCCATCACTTCAGTGCCGGTTAGTTCCGGACTAGTTCAAGCT-TG 236
|| | ||||| |||||||||||||||||||||||||| | |||| ||||| | || |
Sbjct 181 AT-GTGATTGGCTCATCCATCACTTCAGTGCCGGTTAATGATGGACCAGTTCCACCTCT- 238
Query 237 TGGACATGATTCTCCCGCTACAGTTTCAACCTCAACATCATCTCCAGTTCAAATCCAAGC 296
|| || ||||||| | ||| |||||||| ||||||||||||||||||||| ||
Sbjct 239 TGAACTTGATTCTGCTGCTGTTGTTTCAACTTCAACATCATCTCCAGTTCAA------GC 292
Query 297 TCTTGGACATGATTCTGCTGCTACGGTTTCAACCTCAACATCATCTCCA-TTCAAAGTT- 354
||||||||||||||||| |||||| ||||| |||||||||||||||||| |||||| ||
Sbjct 293 TCTTGGACATGATTCTGGTGCTACAGTTTCTACCTCAACATCATCTCCAGTTCAAATTTT 352
Query 355 --CTTCACCATTCAGCTTCGGATCCACTCCTGCTGCCATCACA 395
||||||||||||||||||||||| ||| |||||||||||||
Sbjct 353 TTCTTCACCATTCAGCTTCGGATCCGCTCATGCTGCCATCACA 395
Score = 130 bits (70), Expect = 5e-29
Identities = 97/110 (88%), Gaps = 2/110 (2%)
Strand=Plus/Plus
Query 399 TTCTTCGCTATTCAGCTTCGGATCCACTCCTGCTGCCATCACATCCGTTAGTTCTGGACC 458
|||||| | |||||| ||||||||| ||||||||||||||||||||||||||||||| ||
Sbjct 515 TTCTTCACCATTCAGTTTCGGATCCGCTCCTGCTGCCATCACATCCGTTAGTTCTGGTCC 574
Query 459 AGCGCAATCTCCTGCCTCAACACCTAAATTCGGGTTC-AGTACATTTGCT 507
||||||||||||||||||| |||||| ||| || || | || |||||||
Sbjct 575 AGCGCAATCTCCTGCCTCATCACCTAGATTATGGATCGA-TAGATTTGCT 623
> AT1G63530.1 | Symbols: | BEST Arabidopsis thaliana protein match
is: hydroxyproline-rich glycoprotein family protein (TAIR:AT1G63540.1);
Has 10212 Blast hits to 4024 proteins in 434
species: Archae - 1; Bacteria - 1259; Metazoa - 3608; Fungi
- 2247; Plants - 291; Viruses - 90; Other Eukaryotes - 2716
(source: NCBI BLink). | chr1:23563315-23565555 FORWARD LENGTH=1925
Length=1925
Score = 274 bits (148), Expect = 2e-72
Identities = 244/289 (84%), Gaps = 12/289 (4%)
Strand=Plus/Plus
Query 7 TTGTCTCCGTAAGCTTCTCCTTCATTCA-CTTGGTCGTT-CCCCTGCTTCCTACTTGAT- 63
||| ||| ||||||| ||| |||| | | || |||||| | || |||||||||| | |
Sbjct 26 TTGCCTCAGTAAGCTCCTCTTTCACT-ATCTCCGTCGTTGCTCC-GCTTCCTACTGGTTC 83
Query 64 AAAGTTTGAAAGATTGAAAGAAGAATGAAGATTGATTTTTCTGAGCCAGAGTGCAGCAAC 123
||||||||||||| |||||||||||| | ||||||||||||||||||||| ||| |||
Sbjct 84 AAAGTTTGAAAGA-CAAAAGAAGAATGAGGGTTGATTTTTCTGAGCCAGAGTACAGGAAC 142
Query 124 TGTTACCAGTGTAGCAATCCAGGCATACTCAATACTCAAGAATCCGAACCG-AGAGATAG 182
|||||||| ||| || || |||||||||| |||||||||||||| |||||| | | || |
Sbjct 143 TGTTACCATTGTCGCGATTCAGGCATACTTAATACTCAAGAATCTGAACCGAACA-AT-G 200
Query 183 -GATTGACTCATCCATCACTTCAGTGCCGGTTAGTTCCGGACTAGTTCAAGCT-TGTGGA 240
||||| |||||||||||||||||| |||||||||||||||| | |||||||| | ||||
Sbjct 201 TGATTGGCTCATCCATCACTTCAGTTCCGGTTAGTTCCGGACCATTTCAAGCTCT-TGGA 259
Query 241 CATGATTCTCCCGCTACAGTTTCAACCTCAACATCATCTCCAGTTCAAA 289
|||| |||| | |||| | |||||||||||||||||||||||||||||
Sbjct 260 CATGCTTCTGCTGCTATTGGTTCAACCTCAACATCATCTCCAGTTCAAA 308
Score = 252 bits (136), Expect = 9e-66
Identities = 235/279 (84%), Gaps = 21/279 (8%)
Strand=Plus/Plus
Query 1360 AGACCCCATGAAACCGCAACTATTTCTCCCCCAGCACATGGATGCACTGCATGTGGAGCA 1419
|||||||||||||| | || ||||| |||||||| ||||||||| || |||||||||
Sbjct 1349 AGACCCCATGAAACAGGTGCTGTTTCTTCCCCAGCATTTGGATGCACAGCCTGTGGAGCA 1408
Query 1420 ACGAGTAGCTCCTCTGCTTCTGGTCATTTCACCTTTAATGGTACCACAACTCCTCCATCA 1479
|| ||||||||||||||||||| ||| |||||||||||||| |
Sbjct 1409 ACAAGTAGCTCCTCTGCTTCTGATCACTTCACCTTTAATGG----------------T-- 1450
Query 1480 GCTGCTACAACTCCTCCCGGGTTGTTCTTTCCTACCAGTGGTTTTGGCCCTATGATGTTT 1539
|| | ||||||||||||||||||||||||||||||| | ||| ||| ||||||||||||
Sbjct 1451 GC--C-ACAACTCCTCCCGGGTTGTTCTTTCCTACCACTAGTTCTGGTCCTATGATGTTT 1507
Query 1540 GGTACAACTCTTGCTGTTCAAGGCACAACTCCAGCACTTCAAGCCTATCCTATTCAAGGT 1599
|| ||||||||||||| ||||||||||||||||||||||||| |||||||| ||||||||
Sbjct 1508 GGAACAACTCTTGCTGCTCAAGGCACAACTCCAGCACTTCAAACCTATCCTGTTCAAGGT 1567
Query 1600 TACATTCTTCTCCCGTTCGCCGCCATGAGTCTGCAGTAA 1638
| |||||||||||| ||||||||||||||||||||||||
Sbjct 1568 TTCATTCTTCTCCCATTCGCCGCCATGAGTCTGCAGTAA 1606
Lambda K H
1.33 0.621 1.12
Gapped
Lambda K H
1.28 0.460 0.850
Effective search space used: 80923278540
Database: TAIR10_cdna_20110103_representative_gene_model_updated
Posted date: Sep 25, 2014 6:13 PM
Number of letters in database: 51,074,197
Number of sequences in database: 33,602
Matrix: blastn matrix 1 -2
Gap Penalties: Existence: 0, Extension: 2.5