BLASTN 2.2.26+ Reference: Zheng Zhang, Scott Schwartz, Lukas Wagner, and Webb Miller (2000), "A greedy algorithm for aligning DNA sequences", J Comput Biol 2000; 7(1-2):203-14. Database: TAIR10_cdna_20110103_representative_gene_model_updated 33,602 sequences; 51,074,197 total letters Query= Ahg907972 Length=327 Score E Sequences producing significant alignments: (Bits) Value AT4G01590.1 | Symbols: | unknown protein; BEST Arabidopsis tha... 468 2e-131 AT4G35680.1 | Symbols: | Arabidopsis protein of unknown functi... 387 5e-107 > AT4G01590.1 | Symbols: | unknown protein; BEST Arabidopsis thaliana protein match is: Arabidopsis protein of unknown function (DUF241) (TAIR:AT4G35680.1); Has 1908 Blast hits to 1345 proteins in 175 species: Archae - 3; Bacteria - 106; Metazoa - 494; Fungi - 346; Plants - 115; Viruses - 71; Other Eukaryotes - 773 (source: NCBI BLink). | chr4:688881-690415 REVERSE LENGTH=1022 Length=1022 Score = 468 bits (253), Expect = 2e-131 Identities = 307/332 (92%), Gaps = 8/332 (2%) Strand=Plus/Plus Query 1 ATGGATCTTTCTTTGATTTTCTTCTTCTTAGACCCTGATAACTTCCCTAAACAACTTCTT 60 ||||||||||||||||||||||| |||||||| |||||||||||||||||| |||||||| Sbjct 498 ATGGATCTTTCTTTGATTTTCTTGTTCTTAGA-CCTGATAACTTCCCTAAAGAACTTCTT 556 Query 61 GGAGAAACTCGAAGAG---AACGGCATGTTAAGAGAGCTAAATGGAGTCAAGAAGCTGAT 117 ||||| |||||||||| |||||| ||| |||||||||||||||||||||||||||||| Sbjct 557 GGAGATACTCGAAGAGAACAACGGCCTGTGAAGAGAGCTAAATGGAGTCAAGAAGCTGAT 616 Query 118 TTGCAGAAATTGGATGTGTTTGAGAAGCTTGAAGCTAAGTCTAATGCtgaaggtaaggaa 177 |||||||||||||||||||||||||||||||||||||||| ||| | |||||| |||||| Sbjct 617 TTGCAGAAATTGGATGTGTTTGAGAAGCTTGAAGCTAAGTTTAAGGTTGAAGGCAAGGAA 676 Query 178 gagaaagaagaaggagaagatgatgaagaagttgaggaatcagaaggagaagaaTCTGAT 237 |||||||||||||| ||||||||||||||||||| ||||||||||||||||||||||||| Sbjct 677 GAGAAAGAAGAAGGGGAAGATGATGAAGAAGTTGTGGAATCAGAAGGAGAAGAATCTGAT 736 Query 238 AACAGAGATTATGATCAGAATCAAGACTTTGATGATGATAATGACGATTATAATCAA-GC 296 ||| ||||||||||||||||||||||||||||||||||| | |||||||||||| || | Sbjct 737 AACGGAGATTATGATCAGAATCAAGACTTTGATGATGATGACGACGATTATAAT-AACGA 795 Query 297 GGATGATGG-TGATTTTGAAGAGGTGTATTAA 327 ||||||||| | | || ||||||||||||||| Sbjct 796 GGATGATGGATTAGTT-GAAGAGGTGTATTAA 826 > AT4G35680.1 | Symbols: | Arabidopsis protein of unknown function (DUF241) | chr4:16917749-16920008 FORWARD LENGTH=1960 Length=1960 Score = 387 bits (209), Expect = 5e-107 Identities = 275/305 (90%), Gaps = 12/305 (4%) Strand=Plus/Plus Query 30 AGACCCTGATAACTTCCCTAAACAACTTCTTGGAGAAACTCGAAGAG---AACGGCATGT 86 ||| |||||||||||| ||||| ||||| ||||||| |||||||||| |||||| ||| Sbjct 1402 AGA-CCTGATAACTTCTCTAAAGAACTTGTTGGAGATACTCGAAGAGAACAACGGCCTGT 1460 Query 87 TAAGAGAGCTAAATGGAGTCAAGAAGCTGATTTGCAGAAATTGGATGTGTTTGAGAAGCT 146 ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||| Sbjct 1461 GAAGAGAGCTAAATGGAGTCAAGAAGCTGATTTGCAGAAATTGGATGTGTTTGAGAAGCT 1520 Query 147 TGAAG-CTAAGTCTAATG-Ctgaaggtaaggaagagaaagaagaaggagaagatgatgaa 204 ||| | |||||| ||| | || |||| || |||||||||||||| ||||||||||||||| Sbjct 1521 TGA-GTCTAAGTTTAA-GACTCAAGGCAATGAAGAGAAAGAAGACGGAGAAGATGATGAA 1578 Query 205 gaagttgaggaatcagaaggagaagaaTCTGATAACAGAGATTATGATCAGAATCAAGAC 264 |||||| ||||||||||||||||||||| |||||| ||||||||||||||||||||||| Sbjct 1579 CAAGTTGTGGAATCAGAAGGAGAAGAATCAGATAACGGAGATTATGATCAGAATCAAGAC 1638 Query 265 TTTGATGATGAT-AATGACGATTATAATCAAGCGGATGATGGTG-ATTTTGAAGAGGTGT 322 |||||||||||| || |||||||||||||| | ||| ||||||| |||| |||||||||| Sbjct 1639 TTTGATGATGATGAA-GACGATTATAATCATGAGGAGGATGGTGGATTT-GAAGAGGTGT 1696 Query 323 ATTAA 327 ||||| Sbjct 1697 ATTAA 1701 Lambda K H 1.33 0.621 1.12 Gapped Lambda K H 1.28 0.460 0.850 Effective search space used: 15231127947 Database: TAIR10_cdna_20110103_representative_gene_model_updated Posted date: Sep 25, 2014 6:13 PM Number of letters in database: 51,074,197 Number of sequences in database: 33,602 Matrix: blastn matrix 1 -2 Gap Penalties: Existence: 0, Extension: 2.5