BLASTN 2.2.26+ Reference: Zheng Zhang, Scott Schwartz, Lukas Wagner, and Webb Miller (2000), "A greedy algorithm for aligning DNA sequences", J Comput Biol 2000; 7(1-2):203-14. Database: TAIR10_cdna_20110103_representative_gene_model_updated 33,602 sequences; 51,074,197 total letters Query= Ahg917054 Length=744 Score E Sequences producing significant alignments: (Bits) Value AT5G54062.1 | Symbols: | unknown protein; FUNCTIONS IN: molecu... 568 4e-161 AT5G53742.1 | Symbols: | Protein of unknown function (DUF1278)... 363 2e-99 AT3G61198.1 | Symbols: | other RNA | chr3:22655884-22656599 RE... 211 7e-54 AT5G54075.1 | Symbols: U3D | U3D; snoRNA | chr5:21945349-219457... 189 3e-47 AT5G53902.1 | Symbols: U3B | U3B; snoRNA | chr5:21887966-218886... 183 2e-45 AT1G10522.1 | Symbols: | unknown protein; FUNCTIONS IN: molecu... 178 7e-44 AT5G53740.1 | Symbols: | unknown protein; FUNCTIONS IN: molecu... 128 7e-29 AT5G53905.1 | Symbols: | unknown protein; CONTAINS InterPro DO... 97.1 2e-19 AT5G54070.1 | Symbols: AT-HSFA9, HSFA9 | heat shock transcripti... 58.4 1e-07 > AT5G54062.1 | Symbols: | unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: endomembrane system; CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF1278 (InterPro:IPR010701); BEST Arabidopsis thaliana protein match is: Protein of unknown function (DUF1278) (TAIR:AT5G53742.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). | chr5:21938793-21939623 FORWARD LENGTH=831 Length=831 Score = 568 bits (307), Expect = 4e-161 Identities = 421/477 (88%), Gaps = 4/477 (1%) Strand=Plus/Plus Query 120 AGTTGATCTCACAAAATGTTGGTCATCACTCTTCAACGTTCAAGGTTGCAATATTGAAAT 179 |||||||||| |||||||||||||||| ||| ||||| |||| ||||||||||| ||||| Sbjct 190 AGTTGATCTCGCAAAATGTTGGTCATCGCTCCTCAACATTCACGGTTGCAATATCGAAAT 249 Query 180 CTTGAAATCTGCTTTAACGGGTAAGTTTGAAAACGTTGGATCCATC-TGCTGCAAGGCAT 238 ||| ||||||| |||||| ||||||||||||||||||||||||| | ||||||||||| | Sbjct 250 CTTTAAATCTGTTTTAACCGGTAAGTTTGAAAACGTTGGATCCA-CGTGCTGCAAGGCTT 308 Query 239 TTACGGAAGTGGATGCAAATTGTTGGCCAAAAATGTTTCCGCTAAATCCGTTATTCCCTT 298 |||| |||||||||||||| ||||||||||||||||||||| | ||||| ||||||||| Sbjct 309 TTACAGAAGTGGATGCAAAGTGTTGGCCAAAAATGTTTCCGTTGAATCCATTATTCCCTC 368 Query 299 CTCTTCTTAAGGATGGTTGTTCTCTCATCAGCGCAGCTGCACCCGCACACACGGCACCTC 358 ||||||| ||||||||||| |||| ||||| | ||| |||||| ||||||| | |||| | Sbjct 369 CTCTTCTCAAGGATGGTTGCTCTCGCATCATCTCAGGTGCACCAGCACACAAGACACCGC 428 Query 359 AATTCTCTGTCATCCCTGGTTCTTCGATCGATCTCACTAAATGTTTGTCATCACTTGTCA 418 ||||| ||||||||||||||||| ||||||||||||| |||||||||||||||||||||| Sbjct 429 AATTCCCTGTCATCCCTGGTTCTCCGATCGATCTCACAAAATGTTTGTCATCACTTGTCA 488 Query 419 ACGTTCAAGGTTGTGTAACAGAAATCCACAAATCAGTTTTCACAGGAAACTTTGGTAATG 478 ||||||||||||||||||| |||||| |||||||||||||||||||||| | || ||| | Sbjct 489 ACGTTCAAGGTTGTGTAACTGAAATCTACAAATCAGTTTTCACAGGAAAGTGTGATAACG 548 Query 479 TTGGAGCAATGTGCTGCAAAGCGTTT-TCGGCTGTGGATGCAAAATGTTGGCCACAAATG 537 | ||| ||||||||||| |||||| | ||||||||||||||||||||||||| ||||| Sbjct 549 TAGGATTTATGTGCTGCAAGGCGTTTAT-GGCTGTGGATGCAAAATGTTGGCCAAAAATG 607 Query 538 TTTCCGCTAAATCGGTTCTTTCCTTTTCTTCTCAAATCTAAATGTTCTCGCACCAAC 594 ||||| ||||||| |||||| ||| ||| |||||| | ||||||||||| |||| Sbjct 608 TTTCCACTAAATCCGTTCTTCCCTCCTCTCCTCAAAAATGTATGTTCTCGCATCAAC 664 > AT5G53742.1 | Symbols: | Protein of unknown function (DUF1278) | chr5:21813671-21814018 FORWARD LENGTH=348 Length=348 Score = 363 bits (196), Expect = 2e-99 Identities = 289/333 (87%), Gaps = 10/333 (3%) Strand=Plus/Plus Query 271 ATGTTTCCGCTAAATCCGTTATTCCCTTCTCTTCTTAAGGATGGTTGTTCTCTCATCAGC 330 ||||||||| | ||||| ||||||||| ||||||| ||||||||||| |||| ||||| | Sbjct 1 ATGTTTCCGTTGAATCCATTATTCCCTCCTCTTCTCAAGGATGGTTGCTCTCGCATCATC 60 Query 331 GC-AGCTGCACCCGCACACACGGCACCTCAATT-CTCTGTCATCCCTGGTTCTTCGATCG 388 | || ||||| |||||||| ||||||||| || |||||||||||||||| |||||| Sbjct 61 TCTGGC-GCACCAACACACACGAAACCTCAATTCCT-TGTCATCCCTGGTTCTCCGATCG 118 Query 389 ATCTCACTAAATGTTTGTCATCACTTGTCAACGTTCAAGGTTGTGTAACAGAAATCCACA 448 ||||||| |||||||||||||||||||| |||||| ||||||| ||||| |||||| ||| Sbjct 119 ATCTCACAAAATGTTTGTCATCACTTGTTAACGTTGAAGGTTGCGTAACTGAAATCTACA 178 Query 449 AATCAGTTTTCACAGGAAACTTTGGTAATGTTGGAGCA-ATGTGCTGCAAAGCGTTTTCG 507 ||||||||||||||||||| ||||||||||||||| | ||||||||||| |||||||| Sbjct 179 AATCAGTTTTCACAGGAAAGTTTGGTAATGTTGGA-TATATGTGCTGCAAGGCGTTTTCA 237 Query 508 GCTGTGGATGCAAAATGTTGGCCACAAATGTTTCCGCTAAATCGGTTCTTTCCTTTTCTT 567 |||||||||| |||||||||||||||||||||||||||||||| |||||| ||| |||| Sbjct 238 GCTGTGGATGTAAAATGTTGGCCACAAATGTTTCCGCTAAATCCGTTCTTCCCTCCTCTT 297 Query 568 CTCAA-ATCTAAATGTTCTCGC-ACCAACGCAG 598 ||||| | || ||||||| | | |||||||| Sbjct 298 CTCAAGAAGGAA-TGTTCTC-CTATCAACGCAG 328 > AT3G61198.1 | Symbols: | other RNA | chr3:22655884-22656599 REVERSE LENGTH=622 Length=622 Score = 211 bits (114), Expect = 7e-54 Identities = 128/135 (95%), Gaps = 0/135 (0%) Strand=Plus/Minus Query 610 ACCGCCGTGCAACCACCCCGGCAAGGTAGAACTGAGCCGAGTAACGATCCTCTACAGCAC 669 |||||||||||||||||||||||||||||||| ||||||||||||||||||||||||||| Sbjct 191 ACCGCCGTGCAACCACCCCGGCAAGGTAGAACCGAGCCGAGTAACGATCCTCTACAGCAC 132 Query 670 CAGATGCATCCGAAGACAATCGTGGCCGTTAATCACGCTCTGATCGCACGGTCATGGTTC 729 ||||||| ||| ||||| ||||| ||||| ||||||||||||||||||||| |||||||| Sbjct 131 CAGATGCGTCCTAAGACGATCGTAGCCGTCAATCACGCTCTGATCGCACGGCCATGGTTC 72 Query 730 ATCAACCAGGGTTAA 744 ||||||||||||||| Sbjct 71 ATCAACCAGGGTTAA 57 > AT5G54075.1 | Symbols: U3D | U3D; snoRNA | chr5:21945349-21945709 REVERSE LENGTH=361 Length=361 Score = 189 bits (102), Expect = 3e-47 Identities = 125/136 (92%), Gaps = 2/136 (1%) Strand=Plus/Minus Query 610 ACCGCCGTGCAACCACCCCGGCAAGGTAGAACTGAGCCGAGTAACGATCCTCTACAGCAC 669 |||||||||| ||||||||||||||||||||| ||||||||||||||||||||||||||| Sbjct 310 ACCGCCGTGCGACCACCCCGGCAAGGTAGAACCGAGCCGAGTAACGATCCTCTACAGCAC 251 Query 670 CAGATGCATCCGAAGACAATCGTGGCCGTTAATCACGCTCTGA-TCGCACGGTCATGGTT 728 | ||||| ||||| ||| ||||| ||||| ||||||||||| | ||||||||||||||| Sbjct 250 CGGATGCGTCCGAGGACGATCGTAGCCGTCAATCACGCTCT-AGCCGCACGGTCATGGTT 192 Query 729 CATCAACCAGGGTTAA 744 |||||||||||||||| Sbjct 191 CATCAACCAGGGTTAA 176 > AT5G53902.1 | Symbols: U3B | U3B; snoRNA | chr5:21887966-21888667 FORWARD LENGTH=702 Length=702 Score = 183 bits (99), Expect = 2e-45 Identities = 125/137 (91%), Gaps = 3/137 (2%) Strand=Plus/Minus Query 610 ACCGCCGTGCAACCACCCCGGCAAGGTAG-AACTGAGCCGAGTAACGATCCTCTACAGCA 668 |||||||||| |||||||||||||||||| ||| |||||||||||||||||||||||||| Sbjct 535 ACCGCCGTGCGACCACCCCGGCAAGGTAGAAACCGAGCCGAGTAACGATCCTCTACAGCA 476 Query 669 CCAGATGCATCCGAAGACAATCGTGGCCGTTAATCACGCTCTGA-TCGCACGGTCATGGT 727 || ||||| ||||| ||| ||||| ||||| ||||||||||| | |||||||||||||| Sbjct 475 CCGGATGCGTCCGAGGACGATCGTAGCCGTCAATCACGCTCT-AGCCGCACGGTCATGGT 417 Query 728 TCATCAACCAGGGTTAA 744 ||||||||||||||||| Sbjct 416 TCATCAACCAGGGTTAA 400 > AT1G10522.1 | Symbols: | unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 13 growth stages; Has 24 Blast hits to 24 proteins in 8 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 24; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). | chr1:3469636-3471488 FORWARD LENGTH=980 Length=980 Score = 178 bits (96), Expect = 7e-44 Identities = 121/133 (91%), Gaps = 1/133 (1%) Strand=Plus/Minus Query 610 ACCGCCGTGCAACCACCCCGGCAAGGTAGAACTGAGCCGAGTAACGATCCTCTACAGCAC 669 |||||||||| ||||||||||||||| || || |||||||||||||||||||||||||| Sbjct 195 ACCGCCGTGCGACCACCCCGGCAAGGAAG-ACCAAGCCGAGTAACGATCCTCTACAGCAC 137 Query 670 CAGATGCATCCGAAGACAATCGTGGCCGTTAATCACGCTCTGATCGCACGGTCATGGTTC 729 | ||||| ||||||||| ||||| ||| | ||||||||||||| |||||||||||||||| Sbjct 136 CGGATGCGTCCGAAGACGATCGTAGCCATCAATCACGCTCTGACCGCACGGTCATGGTTC 77 Query 730 ATCAACCAGGGTT 742 ||||||||||||| Sbjct 76 ATCAACCAGGGTT 64 > AT5G53740.1 | Symbols: | unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; EXPRESSED IN: synergid; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G53905.1); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). | chr5:21811633-21814908 FORWARD LENGTH=828 Length=828 Score = 128 bits (69), Expect = 7e-29 Identities = 106/124 (85%), Gaps = 2/124 (2%) Strand=Plus/Plus Query 72 AGCCGAGGTG-CTTGTGACACCTCGATTTCCTTCTATTTCTGCTTTTCCAGTTGATCTCA 130 |||||||| | || |||||||| | ||||| ||||||| ||| || |||| ||||||||| Sbjct 420 AGCCGAGG-GACTCGTGACACCGCAATTTCTTTCTATTCCTGGTTCTCCAATTGATCTCA 478 Query 131 CAAAATGTTGGTCATCACTCTTCAACGTTCAAGGTTGCAATATTGAAATCTTGAAATCTG 190 |||||||||||||||||||| ||||| |||||||||| || || || ||||| ||||||| Sbjct 479 CAAAATGTTGGTCATCACTCCTCAACATTCAAGGTTGTAAAATCGAGATCTTTAAATCTG 538 Query 191 CTTT 194 ||| Sbjct 539 TTTT 542 Score = 99.0 bits (53), Expect = 6e-20 Identities = 70/78 (90%), Gaps = 2/78 (3%) Strand=Plus/Plus Query 666 GCACCAGATGCATCCGAAGACAATCGTGGCCGTTAATCACGCTCTGA-TCGCACGGTCAT 724 ||||| ||||| ||||| ||||||||| ||||| ||||||||||| | ||||||||||| Sbjct 31 GCACCGGATGCGTCCGAGGACAATCGTAGCCGTCAATCACGCTCT-AGCCGCACGGTCAT 89 Query 725 GGTTCATCAACCAGGGTT 742 |||||||||||||||||| Sbjct 90 GGTTCATCAACCAGGGTT 107 > AT5G53905.1 | Symbols: | unknown protein; CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF1278 (InterPro:IPR010701); BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G54062.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). | chr5:21888148-21889254 REVERSE LENGTH=507 Length=507 Score = 97.1 bits (52), Expect = 2e-19 Identities = 71/80 (89%), Gaps = 2/80 (3%) Strand=Plus/Plus Query 666 GCACCAGATGCATCCGAAGACAATCGTGGCCGTTAATCACGCTCTGA-TCGCACGGTCAT 724 ||||| ||||| ||||| ||| ||||| ||||| ||||||||||| | ||||||||||| Sbjct 307 GCACCGGATGCGTCCGAGGACGATCGTAGCCGTCAATCACGCTCT-AGCCGCACGGTCAT 365 Query 725 GGTTCATCAACCAGGGTTAA 744 |||||||||||||||||||| Sbjct 366 GGTTCATCAACCAGGGTTAA 385 > AT5G54070.1 | Symbols: AT-HSFA9, HSFA9 | heat shock transcription factor A9 | chr5:21943983-21945651 FORWARD LENGTH=1512 Length=1512 Score = 58.4 bits (31), Expect = 1e-07 Identities = 31/31 (100%), Gaps = 0/31 (0%) Strand=Plus/Plus Query 714 CGCACGGTCATGGTTCATCAACCAGGGTTAA 744 ||||||||||||||||||||||||||||||| Sbjct 1365 CGCACGGTCATGGTTCATCAACCAGGGTTAA 1395 Lambda K H 1.33 0.621 1.12 Gapped Lambda K H 1.28 0.460 0.850 Effective search space used: 36118351693 Database: TAIR10_cdna_20110103_representative_gene_model_updated Posted date: Sep 25, 2014 6:13 PM Number of letters in database: 51,074,197 Number of sequences in database: 33,602 Matrix: blastn matrix 1 -2 Gap Penalties: Existence: 0, Extension: 2.5