Jump to content

英文维基 | 中文维基 | 日文维基 | 草榴社区

Draft:Chromosome 17 open reading frame 107

From Wikipedia, the free encyclopedia

Chromosome 17 Open Reading Frame 107 is a protein encoded by the C17orf107 gene in humans.

An Error has occurred retrieving Wikidata item for infobox

Gene

[edit]

Location

[edit]

Human C17orf107 is located on the short arm of chromosome 17 at p 13.2. With introns, the gene spans nucleotides 4,899,536 to 4,906,715. The spliced gene is 3,201 nucleotides with three exons on the plus strand.[1]

Gene Level Regulation

[edit]

Human C17orf107 is expressed ubiquitously in tissues with the heart and brain tissues having a higher expression and the endocrine and salivary gland tissues having the highest expression.[1][2]

Protein

[edit]
iTasser Secondary Structure of human C17orf107 protein with helixes in magenta and coils in blue

Human C17orf107 protein is 190 amino acids with a molecular mass of approximately 20 kDa and a basal isoelectric point of 7 pH.[3] C17orf107 is a part of the DUFF5536 conserved protein domain which is part of the pfam17688, a member of the superfamily cl39220.[4]

Protein Level Regulation

[edit]

Human C17orf107 protein is localized intracellularly with evidence of being expressed in the nucleoplasm.[2] The protein has no asparagines, therefore there are no N-glycosylation sites.[5] C17orf107 protein does not have post-translational modifications, disulfide bonds, or signal peptides.[6]

Evolution

[edit]

Orthologs

[edit]

Human C17orf107 is found in all mammals except Monotremes, Hyracoidea, Tubulidentata, Cingulata, Peramelemorphia, Paucituberculata, Sirenia, and Notoryctemorphia groups.[7]

Orthologs
Taxonomic Group Common Name Genus and Species Accession Number Date of Divergence (MYA) Sequence Length (aa) Sequence Identity (%) Sequence Similarity (%) Sequence Gaps (%)
Primates Human Homo sapiens NP_001139008.1[8] 190
Western Lowland Gorilla Gorilla gorilla gorilla XP_018868465.1[9] 9 189 82 83 15
Lagomorpha European Rabbit Oryctolagus cuniculus XP_008269053.1[10] 87 204 64 72 13
Rodentia House Mouse Mus musculus EDL12584.1[11] 87 128 33 37 55
Artiodactyla East African Hippopotamus Hippopotamus amphibius kiboko XP_057569928.1[12] 94 183 78 84 4
Sheep Ovis aries KAG5203555.1[13] 94 233 51 55 34
Wild Bactrian Camel Camelus ferus EPY72335.1[14] 94 183 43 50 32
Carnivora Sea Otter Enhydra lutris kenyoni XP_022380488.1[15] 94 216 63 69 19
Giant Panda Ailuropoda melanoleuca XP_034502549.1[16] 94 247 58 63 25
Clouded Leopard Neofelis nebulosa XP_058561310.1[17] 94 90 40 44 53
Cetacea Harbor Porpoise Phocoena phocoena XP_065753064.1[18] 94 179 70 79 11
Narwhal Monodon monoceros TKC42450.1[19] 94 263 26 30 51
Chiroptera Nathusius Pipistrelle Bat Pipistrellus nathusii CAK6437783.1[20] 94 135 39 45 42
Horse Equus caballus XP_005597777.1[21] 94 212 69 75 16
Perissodactyla Southern White Rhinoceros Diceros bicornis minor XP_058415933.1[22] 94 218 67 75 13
Pholidota Sunda Pangolin Manis javanica XP_017503787.1[23] 94 197 65 73 8
Proboscidea Indian Elephant Elephas maximus indicus XP_049717565.1[24] 99 248 56 62 26
Marsupial Agile Gracile Opossom Gracilinanus agilis XP_044529018.1[25] 160 179 50 52 22
Tasmanian Devil Sarcophilus harrisii XP_012404229.1[26] 160 189 48 59 22
Monito Del Monte Dromiciops gliroides XP_043856787.1[27] 160 198 41 52 20

Divergence

[edit]
Human C17orf107 protein divergence evolution compared to Fibrinogen alpha and Cytochrome C in the orthologs House Mouse, Tasmanian Devil, Nathusius Pipstrelle Bat, Sheep, Monito Del Monte, and Sea Otter. The blue triangles represent Fibrinogen Alpha, the green circles represent Cytochrome C, and the orange squares represent C17orf107, with trendlines in correlating colors.

The C17orf107 gene first appeared in mammals with the earliest found species dating back to 160 million years ago in marsupials.[28] The gene evolved at a similar rate to Fibrinogen alpha with a difference of 0.1205 R2 values. Both Fibrinogen alpha and C17orf107 evolve at a quicker rate than Cytochrome C.[7]

References

[edit]
  1. ^ a b "C17orf107 chromosome 17 open reading frame 107". National Library of Medicine. Retrieved 4 December 2024.
  2. ^ a b "C17orf107". The Human Protein Atlas. Retrieved 11 December 2024.
  3. ^ "Uncharacterized protein C17orf107". PhosphoSitePlus. Cell Signaling Technology. Retrieved 4 December 2024.
  4. ^ "Conserved Protein Domain Family DUF5536". NCBI. Retrieved 11 December 2024.
  5. ^ "NetNGlyc". DTU Health Tech. Retrieved 11 December 2024.
  6. ^ "Protter". ETHZürich. Retrieved 11 December 2024.
  7. ^ a b "NCBI Blast". National Library of Medicine. Retrieved 11 December 2024.
  8. ^ "uncharacterized protein C17orf107 [Homo sapiens]". National Library of Medicine. Retrieved 11 December 2024.
  9. ^ "uncharacterized protein C17orf107 homolog [Gorilla gorilla gorilla]". National Library of Medicine. Retrieved 11 December 2024.
  10. ^ "uncharacterized protein C17orf107 homolog [Oryctolagus cuniculus]". National Library of Medicine. Retrieved 11 December 2024.
  11. ^ "mCG21181, isoform CRA_a, partial [Mus musculus]". National Library of Medicine. Retrieved 11 December 2024.
  12. ^ "uncharacterized protein C17orf107 homolog [Hippopotamus amphibius kiboko]". National Library of Medicine. Retrieved 11 December 2024.
  13. ^ "hypothetical protein JEQ12_003138 [Ovis aries]". National Library of Medicine. Retrieved 11 December 2024.
  14. ^ "hypothetical protein CB1_083302001 [Camelus ferus]". National Library of Medicine. Retrieved 11 December 2024.
  15. ^ "uncharacterized protein C17orf107 homolog [Enhydra lutris kenyoni]". National Library of Medicine. Retrieved 11 December 2024.
  16. ^ "uncharacterized protein C17orf107 homolog [Ailuropoda melanoleuca]". National Library of Medicine. Retrieved 11 December 2024.
  17. ^ "uncharacterized protein C17orf107 homolog [Neofelis nebulosa]". National Library of Medicine. Retrieved 11 December 2024.
  18. ^ "uncharacterized protein C17orf107 homolog, partial [Phocoena phocoena]". National Library of Medicine. Retrieved 11 December 2024.
  19. ^ "hypothetical protein EI555_006246, partial [Monodon monoceros]". National Library of Medicine. Retrieved 11 December 2024.
  20. ^ "unnamed protein product [Pipistrellus nathusii]". National Library of Medicine. Retrieved 11 December 2024.
  21. ^ "uncharacterized protein C17orf107 homolog [Equus caballus]". National Library of Medicine. Retrieved 11 December 2024.
  22. ^ "uncharacterized protein C17orf107 homolog isoform X1 [Diceros bicornis minor]". National Library of Medicine. Retrieved 11 December 2024.
  23. ^ "LOW QUALITY PROTEIN: uncharacterized protein C17orf107 homolog [Manis javanica]". National Library of Medicine. Retrieved 11 December 2024.
  24. ^ "uncharacterized protein C17orf107 homolog [Elephas maximus indicus]". National Library of Medicine. Retrieved 11 December 2024.
  25. ^ "uncharacterized protein C17orf107 homolog [Gracilinanus agilis]". National Library of Medicine. Retrieved 11 December 2024.
  26. ^ "uncharacterized protein C17orf107 homolog [Sarcophilus harrisii]". National Library of Medicine. Retrieved 11 December 2024.
  27. ^ "LOW QUALITY PROTEIN: uncharacterized protein C17orf107 homolog [Dromiciops gliroides]". National Library of Medicine. Retrieved 11 December 2024.
  28. ^ "Divergence Time". TimeTree5. Retrieved 11 December 2024.