Position
| Chromosome | chr17 |
| Start | 7568900 |
| End | 7608908 |
| Strand | + |
view in Gbrowse: chr17:7568900..7608908
Most similar sequence in NCBI nr database
| Accession | Description | E-value | Score |
|---|---|---|---|
| XP_028043810.1 | ATP-binding cassette sub-family A member 3-like isoform X2 | 0.0 | 3398 |
ORF sequence (5106 bp)
ATGATGGCCTTAGAAAAATTACAACTACTTATTTGGAAAAATGTCCTTCTACAATACAGACGCAAATGGCAGACATTATTTGAAATAGCAACTCCTATATTTTTTTGCTTTTTTCTAATACTTATGAGATATTTAGTGCCGCCCAAAGCAATACCGGCTAAAACTTACACATCCTTTGATGTCACCTATTTCAATAAGACAAGATTGCTTAGGGGAAATCTGTCATCAGCTGACATGGTATTGGCATACTCACCAATAAATCAATTGACTGAAAAAGTTGCAATCAATGCTATGGCTGAACTTGCTCGAGATTCAATTGATCCTGATATATTTTTTTTTATATCACTTGCAATTTCATACACTCCTAAAGGATATAAAAATGCCAAAGATATGGAAGCAGCACTAATACAGCCCAATGCTATGAATAGCATTTTGCTTGGTATACAATTTGATGATGAAATCGCTAATGCTACGTCTTGGCCAGATGATATAAAAATAACTTTTAGGTTTCCTGCTGTTATGAGATCGACAACGTCTGATCATCAAATGCAATTGAGCTGGCAAACAAACTTACTATTTCCATTGTTTCCGAATTCTGGTCCACGAGAACCAGATGATCAATATGGAGGAACTTCGCCAGGTTACTTCTCGGAACTATTCTTGTTCATGCAGCACGCAATTTCCAAAAGCATCGTAAAAGAGAAGACCGGGAAAAACATTGATACCAAAATATATCTACAGCGTTTACCTCAACTTGAATCTAGGGTGGATCAGCTACTGTTCATTTTACAAAGATTTGTATCCATAGCTATAATGCTGGGCTTCGTCTATACATTCGTGAATACAGTCAGGGCTGTCACAACTGAGAAAGAACTGCAGCTAAAGGAAACAATGGCAATTATGGGTCTACCGTCTTGGCTGCACTGGTTAGCGTGGTTCATCAAACAATTTTCATTCTTACTTGTGACTATTGTCATAATGGTTATATTGTTTAAGATTCCGTTATCCGAAACTGAAACAGGAGTGTCGTACTCCGTGTTCACCTACACGCCGTGGTCGGTCCTTTTATTCTTTTTGATTCTATTCGTCATTGCATCATTGACATTCTCTTTTATGATAAGCGTTTTCTTCACTAGAGCAAACACCGCCGCGTCTTTTATGGCTCTCATATGGTTCGCAACGTTCGCAGTCTTTATGTTCACCCAAATGATTAACGAGACCATGTCTGCCCCAGTTAAAATAACACTATGTTTGTTATCAAACACTGCCATTGGATACGCATTCCAGATGATTATCATTGCCGAAGGAACGATGCAAGGTTTACAATGGGCCAATTTCTTTGAGCCGATATCGTACAGTGAAAACTTTCAACCGGCTCATGTAGCCTTCATGCTAATATTGGACTCCGTGATGTACATGGCGATCGCCGTGTATGTGGAGAACATACGACCCGGAATGTACGGAGTGCCGTTGCCTTGGTACTATCCTTTCACTGTTAGTTATTGGTGGCCGAACAGAATATCTGACACAAATCGAGAGAAAACATCTGACGATCTTGAATATAATGATGCATTATTGTCGGTGGTACACGACGAGGAACCCAAAGGCGTTCCCATTGGCGTAAACATTCAAAATCTCACAAAAAGATATAAGGGACGGGGGAAAGCTGTCGATAATTTAAATCTTAGACTTTATGAAAACGAGATTACAGTACTTCTGGGTCACAATGGAGCGGGGAAAACGACAACGATTTCTATGTTAACAGGAATGGTCCCTCCAACATCAGGATCGGTAACGATCAATGGATATGATATCGTGACGGAAACGGAAAAAGCTAGACGTTCTCTTGGAATATGTCCTCAACATAATGTACTGTTCCCTGACCTGACCGTAGCTGAACATTTAATTTTTTATTCAAAGCTAAAGGGGATTCCCGAATCAGAAATCAATGAAGAAATAGATCATTTCGTCAAACTTCTCGAATTGGATGACAAGAGGCACGCGGCGGCATCGAGTCTGTCGGGCGGGCAGAAGCGGCGGCTGTCGGCGGGCTGCGCGCTGTGCGGGCGCTCGCGCGTGGTGCTGCTGGACGAGCCCACGTCGGGCCTGGACCCGGCGGCGCGGCGCGCGCTGTGGGACCTGCTGCAGCGCGAGAAGCGCGGCCGCACCGTGCTGCTGACCACGCACTTCATGGACGAGGCGGACGTGCTGGCCGACCGCGTGGCCGTGCTCGCCGCCGGCCGCCTGGCCTGTCTGGGCTCCCCCTACTTCCTCAAGCGCCACTACGGACTCGGCTACAAGCTCGCGCTCGTCAAGGACGCCGCCTGCCAAGTCGATCTTGTCACGGAATTCTTCAAAACATATGTCCCTAATCTCAAGCAAAATTCAAATATCGGCTCAGAATTAACTTACATTTTACCTAGTGAGAGTGTAAGTAAATTTCCCGAAATGCTCAAAAAACTTGAAGAAAAGAAAGAGTCTTTATGTATATCAAGCTACGGGTTATCAGTGACTAGCCTTGAGGAAGTTTTTATGAAAGCTGGAATCGAAGATAACAATGTTGAAATAAAAGAAACTAAAGGAGATATCGAAATGATTGATATGAATGGGGACATGTTGAATAAATATTTTATTTCAGAGGACAATGAGCCTTTACATAAAACGCAAGGGTTTCATTTACTGAAGAATCACATAAAAGCTATGTTCTTGAAACTGATGTACAACACATTGAGAAATAAGGCACTCGCTGCGATACAAATAATATGGCCAATAATAAACATTATTTTATCGATGATAGTTTCACTATCTTGGAAATTCTTGAATGTATTGCCCCCGCTTGAACTAAGCCTGGAGAGTGGATTTAAAGGAACCGAGACATTAGTATCGCAAGGTAACGATTTGAGAGACGGCAGTACTGAAGCTAATGTCATGATGGCATACAAAGATTATTTCAAACGATCAACGTACCCCGGCTTGAAGTTATTGGATGTCGGAACTTCTAACTTAAAAAACGTCTATTTGAAACTTATCGCAGAAGATCAATCTCGAGTACGATATGAAGATCTAGTGGGAGCTACATTTCGTAATAACAGCATAACAGCCTGGTTCAGTAACTACGGTCTTCATGATTCTGCTATCTCGCTTTCACTAGTCGAGAACGCAATAATTCGTTCTCTGTCACCCAATACAACTTTGACATTTGTCAACCATCCGCTACCATATTCAGTTGAAGGAATGGTTCAAGTAATGTCTACCGGAACAAACACCGCCTTCATGTTTTCTTTTAGCCTGGGATTTTGTATAGCGGTTATAAGCTCTTTTCTGGTTCTCTTTGTAATTAAAGAGCGCATCAGCGGTGCGAAACTTCTCCAAAGAGTATCAGGAGTGCGACCGGTAGTAATGTGGAGTACTGCCCTCATTTGGGATTGGATTTGGTTGTTTCTGAACCACATTTGCATTATAGTCACTATTGCGTGCTTCCAAGAAATGGGAATGTCGACGCCTGCTGAACTTGGTCGAATTTTATTAGTTCTGATGGTGTTTTCATTGGCAATTATACCGTTGCACTACCTCGCATCGTTCTGTTTCGAAGAAGCCGCCACCGGTTTCAGTAAAATGGTGTTTGTAAATATATTTTGTGGTTCAATGTTGTTCCTTGTCACTGAAGTATTACGGATGCCTTTCATAAATGCTGCTGCTTACGCAGAAATACTTGAGTATCCATTTTCATTGTTACCAATCTACTGTGTCAGCAAGAGTGTCAGGGAAATGGTGACATCTTCAATAAAGATTAAAGCCTGCGACAGCTTATGCAACCAATTAAATTATAAAAATTGCACACGACTAACTATATGCAATGAACTAGACATATCCATGTGTTGTATTGAGGATAATCCATTTTTAGGGTGGAAGGAACCGGGTATTGCAAGATATCTATTTACTATGATAGTTGTAGCGACTGTGTCATTTGCAATATTGCTAGCCAAGGAATACGAACTTTGGAACAAGACTATGATGTTATCTGGTACAAAACCAAAATCTAATGAGAGTAAAAAGGTTGAAGTAAATGCAGAAGTTGAAGATGATGATGTTGTGGAGGAAAAACAGCGTGTTCTAGCAATGACAAGTAGTGAGGTCACCGCACACAGCCTCGTGTGTCGCGAGCTGAGCAAGCGCTACCGGCGCCTCGTAGCCGTCGACCGACTCACGTTCGCGGTGCGCGGCGGAGAGTGCTTCGGCCTGCTCGGAGTCAACGGCGCCGGCAAGACCAGCACCTTCCGCATGCTGACCGGCGACGCACGCGTGTCGGACGGCGACGCGCTCGTGCACGGACACTCCGTGCGAGCACACGTGCAGGACGTGCACCGCCTCATTGGTTACTGCCCCCAATTCGATGCACTGTTTGACAATTTAACCGCAAGGGAGATATTGAAGATTTTCTGTTTGCTGCGTGGCATTCCTACGTCAATAGGCGAAACTCATGCCATTCATCTTGCTAAACAATTGGGATTCATAAAGCACTATGACAAGAAGGTTCGGGAATGTAGTGGTGGAACAAAACGTAAAATCAGTACAGCGGTCGCGTTGCTCGGTGATTACCCAGTTATATTCCTGGATGAGCCTACGACAGGCATGGATCCGGCGTCGAAGCGGCTCGTGTGGCGCGGCATCAGCAGCGCGGTGGGCGGCGGGCGCAGCGTGGTGCTGACGTCACACAGCATGGAGGAGTGCGAGGCTCTCTGCTCCAAGCTCACCGTCATGGTCAACGGCAGGCTCTGCTGTCTCGGCTCGCTGCAACATCTCAAGAGCAAATTCTCACAGGGATACACAATAATCGTGAAATGTAAATCGGGTCCAAATCGAGACGCAGCAGTGCTAGACGTCCACAACTATATGACTACAAATTTTGTTGGTGCTAACCTCATCGAGACGTACCTGGGCATGAGCACGTACCACGTGTCGTCGGCGGGGCTGCCGTGGTGGCGCGTGTTCAGTGCGCTCGAACTAGCGCGGGACTCGCTGCCGCTTGATGACTACTCGGTCGCGCAGACAACACTCGAGCAAGTTTTCCTCGCATTTACAAAGCTCCAACGTCCTATAAATTAA
Protein sequence (1701 aa)
MMALEKLQLLIWKNVLLQYRRKWQTLFEIATPIFFCFFLILMRYLVPPKAIPAKTYTSFDVTYFNKTRLLRGNLSSADMVLAYSPINQLTEKVAINAMAELARDSIDPDIFFFISLAISYTPKGYKNAKDMEAALIQPNAMNSILLGIQFDDEIANATSWPDDIKITFRFPAVMRSTTSDHQMQLSWQTNLLFPLFPNSGPREPDDQYGGTSPGYFSELFLFMQHAISKSIVKEKTGKNIDTKIYLQRLPQLESRVDQLLFILQRFVSIAIMLGFVYTFVNTVRAVTTEKELQLKETMAIMGLPSWLHWLAWFIKQFSFLLVTIVIMVILFKIPLSETETGVSYSVFTYTPWSVLLFFLILFVIASLTFSFMISVFFTRANTAASFMALIWFATFAVFMFTQMINETMSAPVKITLCLLSNTAIGYAFQMIIIAEGTMQGLQWANFFEPISYSENFQPAHVAFMLILDSVMYMAIAVYVENIRPGMYGVPLPWYYPFTVSYWWPNRISDTNREKTSDDLEYNDALLSVVHDEEPKGVPIGVNIQNLTKRYKGRGKAVDNLNLRLYENEITVLLGHNGAGKTTTISMLTGMVPPTSGSVTINGYDIVTETEKARRSLGICPQHNVLFPDLTVAEHLIFYSKLKGIPESEINEEIDHFVKLLELDDKRHAAASSLSGGQKRRLSAGCALCGRSRVVLLDEPTSGLDPAARRALWDLLQREKRGRTVLLTTHFMDEADVLADRVAVLAAGRLACLGSPYFLKRHYGLGYKLALVKDAACQVDLVTEFFKTYVPNLKQNSNIGSELTYILPSESVSKFPEMLKKLEEKKESLCISSYGLSVTSLEEVFMKAGIEDNNVEIKETKGDIEMIDMNGDMLNKYFISEDNEPLHKTQGFHLLKNHIKAMFLKLMYNTLRNKALAAIQIIWPIINIILSMIVSLSWKFLNVLPPLELSLESGFKGTETLVSQGNDLRDGSTEANVMMAYKDYFKRSTYPGLKLLDVGTSNLKNVYLKLIAEDQSRVRYEDLVGATFRNNSITAWFSNYGLHDSAISLSLVENAIIRSLSPNTTLTFVNHPLPYSVEGMVQVMSTGTNTAFMFSFSLGFCIAVISSFLVLFVIKERISGAKLLQRVSGVRPVVMWSTALIWDWIWLFLNHICIIVTIACFQEMGMSTPAELGRILLVLMVFSLAIIPLHYLASFCFEEAATGFSKMVFVNIFCGSMLFLVTEVLRMPFINAAAYAEILEYPFSLLPIYCVSKSVREMVTSSIKIKACDSLCNQLNYKNCTRLTICNELDISMCCIEDNPFLGWKEPGIARYLFTMIVVATVSFAILLAKEYELWNKTMMLSGTKPKSNESKKVEVNAEVEDDDVVEEKQRVLAMTSSEVTAHSLVCRELSKRYRRLVAVDRLTFAVRGGECFGLLGVNGAGKTSTFRMLTGDARVSDGDALVHGHSVRAHVQDVHRLIGYCPQFDALFDNLTAREILKIFCLLRGIPTSIGETHAIHLAKQLGFIKHYDKKVRECSGGTKRKISTAVALLGDYPVIFLDEPTTGMDPASKRLVWRGISSAVGGGRSVVLTSHSMEECEALCSKLTVMVNGRLCCLGSLQHLKSKFSQGYTIIVKCKSGPNRDAAVLDVHNYMTTNFVGANLIETYLGMSTYHVSSAGLPWWRVFSALELARDSLPLDDYSVAQTTLEQVFLAFTKLQRPIN
Domains and motifs
| Database | ID | Description | Start | End | Evalue | InterPro ID |
|---|---|---|---|---|---|---|
| PANTHER | PTHR19229 | - | 7 | 1690 | 0.0 | IPR026082 |
| Pfam | PF12698 | ABC-2 family transporter protein | 24 | 477 | 3.2e-17 | - |
| SUPERFAMILY | SSF52540 | - | 527 | 772 | 1.5e-61 | IPR027417 |
| Gene3D | 3.40.50.300 | - | 538 | 772 | 8.8e-71 | - |
| ProSiteProfiles | PS50893 | ATP-binding cassette, ABC transporter-type domain profile. | 541 | 771 | 21.349 | IPR003439 |
| CDD | cd03263 | ABC_subfamily_A | 543 | 759 | 5.4e-113 | - |
| Pfam | PF00005 | ABC transporter | 557 | 701 | 5.8e-31 | IPR003439 |
| SMART | SM00382 | - | 566 | 748 | 7.9e-09 | IPR003593 |
| Pfam | PF12698 | ABC-2 family transporter protein | 989 | 1327 | 2.7e-30 | - |
| Gene3D | 3.40.50.300 | - | 1372 | 1616 | 2.1e-55 | - |
| CDD | cd03263 | ABC_subfamily_A | 1384 | 1602 | 1.2e-97 | - |
| ProSiteProfiles | PS50893 | ATP-binding cassette, ABC transporter-type domain profile. | 1384 | 1614 | 16.54 | IPR003439 |
| SUPERFAMILY | SSF52540 | - | 1384 | 1605 | 2.0e-48 | IPR027417 |
| Pfam | PF00005 | ABC transporter | 1400 | 1542 | 8.5e-20 | IPR003439 |
| SMART | SM00382 | - | 1408 | 1593 | 1.1e-03 | IPR003593 |
InterPro assignment
| InterPro ID | InterPro description |
|---|---|
| IPR003439 | ABC transporter-like |
| IPR003593 | AAA+ ATPase domain |
| IPR026082 | ABC transporter A |
| IPR027417 | P-loop containing nucleoside triphosphate hydrolase |
Gene ontology (GO) assignment
| GO category | GO ID | GO description |
|---|---|---|
| molecular function | GO:0005524 | ATP binding |
| cellular component | GO:0016021 | integral component of membrane |
| molecular function | GO:0016887 | ATPase activity |
| molecular function | GO:0042626 | ATPase-coupled transmembrane transporter activity |
| biological process | GO:0055085 | transmembrane transport |
| Species | Accession |
|---|---|
| Danaus plexippus | DPOGS200378 |
| Heliconius melpomene | HMEL005382g1.t1 |
| Manduca sexta | XP_030022238.1 |
| Plutella xylostella | g742.t1 |
| Spodoptera frugiperda (corn) | GSSPFG00014947001-PA |
| Spodoptera frugiperda (rice) | SFRICE015440-PA |
| Acyrthosiphon pisum | XP_003246127.1 XP_029344491.1 XP_029344492.1 |
| Aedes aegypti | AAEL008386-PC |
| Anopheles gambiae | AGAP006379-PA AGAP012156-PA |
| Apis mellifera | XP_397465.5 |
| Drosophila melanogaster | FBpp0304765 |
| Tribolium castaneum | XP_008199153.1 XP_015840355.1 |
| Homo sapiens | NP_001080.2 |
| Mus musculus | NP_001034670.1 NP_038883.2 XP_006524433.1 |