Jump to content

CXorf49

From Wikipedia, the free encyclopedia

CXorf49 is a protein, which in humans is encoded by the gene chromosome X open reading frame 49(CXorf49).

Gene

[edit]
The image shows the exact location of CXorf49 on the minus strand of the X chromosome.

The CXorf49 gene has one alias CXorf49B.[1] The recname A8MYA2 also refers to the protein coded by CXorf49 or CXorf49B.[2]

CXorf49 is located on the X chromosome at Xq13.1. It is 3912 base pairs long and the gene sequence has 6 exons.[3] CXorf49 has one protein coding transcript.[4]

Protein

[edit]

The protein has 514 amino acids and a molecular mass of 54.4 kDa.[5] The isoelectric point is 9.3. Compared to other human proteins CXorf49 is glycine- and proline-rich, but the protein has lower levels of asparagine, isoleucine, tyrosine and threonine(Statistical Analysis of Protein Sequences, SAPS [6]).

Domains

[edit]
Image of the protein with the domain of unknown function.

The domain of unknown function, DUF4641, is almost the entire protein. It is 433 amino acids long, from amino acid 80 until amino acid number 512.[7] DUF4641 is a part of pfam15483.[8] The domain is proline- and arginine-rich, but DUF4641 has lower levels of isoleucine, tyrosine and threonine compared to other proteins in human (Analysis of Protein Sequences, SAPS [6]). DUF4641 has an unusual spacing between lysine residues and positive charged amino acids (Analysis of Protein Sequences, SAPS [6]).

Post-translation modifications

[edit]

CXorf49 is predicted to have several post-translational sites. This include sites for N-acetyltransferase (NetAcet 1-[9]), glycation of ε amino groups of lysines (NetGlycate 1.0[10]), mucin type GalNAc O-glycosylation (NetOglyc 4.0[11]), phosphorylation (NetPhos 2.0[12]), sumoylation (SUMOplot Analysis Program[13]) and O-ß-GlcNAc attachment(YinOYang WWW[14]).

Subcellular localization

[edit]

The CXorf49 protein has been predicted to be located in the cell nucleus (PSORT II [15]).

Expression

[edit]

Promoter region

[edit]

The promoter region of CXorf49 is located between base pair 71718051 and 71718785 on the minus strand of the X chromosome and it is 735 bp long (Genomatix’s ElDorado program[16]). One of the most frequent transcription factor binding-sites in the promoter region are sites for Y-box binding factor.

Expression

[edit]

Though expression of CXorf49 is very low in human cells, is it somewhat higher in connective tissues, testis and uterus(NCBI-Unigene [17]).

Interactions

[edit]

The protein CXorf49 has not yet been shown to interact with other proteins (PSICQUIC[18]).

CXorf49 is found to be one of the components of a small group of the HL-60 cell proteome that were most prone to form 4-Hydroxy-2-nonenal(HNE) adducts, upon exposure to nontoxic (10 μM) HNE concentrations, along with heat shock 60 kDa protein 1.[19]

Homology

[edit]

Using BLAST[20] no orthologs for CXorf49 are found in single celled organisms, fungi or plants whose genomes have been sequenced. For multicellular organisms orthologs are found in mammals. The table below show a selection of the mammal orthologs. They are listed after time of divergence from human.

Genus and species name Common name Accession Number Sequence length Identity to human protein
Pan troglodytes Chimpanzee XP_001137982 514 aa 98 %
Callithrix jacchus Common marmoset XP_008987719 487 aa 65 %
Galeopterus variegatus Malayan flying lemur XP_008574823 525 aa 54 %
Tupaia chinensis Chinese tree shrew XP_006168003 527 aa 35 %
Chinchilla lanigera Long-tailed chinchilla XP_013358263 307 aa 49 %
Mus musculus House mouse NP_081944 513 aa 36 %
Canis lupus familiaris Dog XP_850392 526 aa 54 %
Odobenus rosmarus divergens Pacific walrus XP_012422579 530 aa 51 %
Mustela putorius furo Ferret XP_004777306 544 aa 50 %
Lipotes vexillifer Chinese river dolphin XP_007452050 529 aa 45 %
Ovis areis Sheep XP_004022229 536 aa 45 %
Capra hircus Goat XP_005700711 538 aa 44 %
Myotis lucifugus Little brown bat XP_006083036 500 aa 42 %
Myotis davidii David's myotis XP_006759573 495 aa 42 %
Bos taurus Cattle NP_001092664 534 aa 42 %
Equus asinus Asinus XP_014707878 723 aa 42 %
Trichechus manatus latirostris Florida manatee XP_012415455 505 aa 44 %
Dasypus novemcinctus Nine-banded armadillo XP_004475873 497 aa 44 %
Orycteropus afer afer Aardvark XP_007957133 477 aa 38 %

Phylogeny

[edit]

CXorf49 has developed from aardvarks, to the human protein over 105.0 million years.

This phylogenetic tree made with CRUSTALW on SDSC Biology Workbench[6] shows how CXorf49 in Human (Hsa), Chimpanzee(Ptro), Malayan flying lemur(Gava), Sheep (Ovari), Pacific walrus(Ord), Aardvark(Oafaf), Chinese tree shrew (Tuchi) and House mouse(Mmus) has diverged over time.

References

[edit]
  1. ^ "Homo sapiens chromosome X open reading frame 49 (CXorf49), mRNA - Nucleotide - NCBI". Ncbi.nlm.nih.gov. 2015-09-28. Retrieved 2016-04-28.
  2. ^ "RecName: Full=Uncharacterized protein CXorf49 - Protein - NCBI". Ncbi.nlm.nih.gov. 2015-09-28. Retrieved 2016-04-28.
  3. ^ "CXorf49 chromosome X open reading frame 49 [Homo sapiens (human)] - Gene - NCBI". Ncbi.nlm.nih.gov. Retrieved 2016-04-28.
  4. ^ "Gene & protein Summary: cxorf49". Ebi.ac.uk. Retrieved 2016-04-28.
  5. ^ "CXorf49 Gene(Protein Coding) Chromosome X Open Reading Frame 49". GeneCards. Retrieved 2016-04-28.
  6. ^ a b c d "SDSC Biology Workbench". seqtool.sdsc.edu. Archived from the original on 2003-08-11. Retrieved 2016-05-06.
  7. ^ "uncharacterized protein CXorf49 [Homo sapiens] - Protein - NCBI". Ncbi.nlm.nih.gov. 2015-09-28. Retrieved 2016-04-28.
  8. ^ "NCBI CDD Conserved Protein Domain DUF4641". www.ncbi.nlm.nih.gov. Retrieved 2016-05-06.
  9. ^ "NetAcet 1.0 Server". Cbs.dtu.dk. Retrieved 2016-04-28.
  10. ^ "NetGlycate 1.0 Server". Cbs.dtu.dk. Retrieved 2016-04-28.
  11. ^ "NetOGlyc 4.0 Server". Cbs.dtu.dk. 2013-05-15. Retrieved 2016-04-28.
  12. ^ "NetPhos 2.0 Server". Cbs.dtu.dk. Retrieved 2016-04-28.
  13. ^ "SUMOplot Analysis Program". Abgent. Retrieved 2016-04-28.
  14. ^ "YinOYang 1.2 Server". Cbs.dtu.dk. Retrieved 2016-04-28.
  15. ^ http://psort.hgc.jp/cgi-bin/runpsort.pl[permanent dead link]
  16. ^ "Genomatix's ElDorado". Archived from the original on 2021-04-03. Retrieved 2016-05-06.
  17. ^ "EST Profile - Hs.632817". Ncbi.nlm.nih.gov. Retrieved 2016-04-28.
  18. ^ "PSIQUIC". Archived from the original on 2014-12-17.
  19. ^ Arcaro, Alessia; Daga, Martina; Cetrangolo, Giovanni Paolo; Ciamporcero, Eric Stefano; Lepore, Alessio; Pizzimenti, Stefania; Petrella, Claudia; Graf, Maria; Uchida, Koji; Mamone, Gianfranco; Ferranti, Pasquale; Ames, Paul R. J.; Palumbo, Giuseppe; Barrera, Giuseppina; Gentile, Fabrizio (2015). "Generation of Adducts of 4-Hydroxy-2-nonenal with Heat Shock 60 kDa Protein 1 in Human Promyelocytic HL-60 and Monocytic THP-1 Cell Lines". Oxidative Medicine and Cellular Longevity. 2015: 296146. doi:10.1155/2015/296146. PMC 4452872. PMID 26078803.
  20. ^ Protein BLAST