Putative uncharacterized protein C6orf52
Putative uncharacterized protein C6orf52 (C6orf52) is a protein in humans that is encoded by the gene "C6orf52" and has six known isoforms.[1] C6orf52 was identified in 2002 by The National Institutes of Health Mammalian Gene Collection (MGC) Program.[2] C6orf52 has one known paralog, tRNA selenocysteine 1-associated protein 1 (TRNAU1AP).[3]
Gene
[edit]The cytogenetic location of C6orf52 is 6p24.2 on the shorthand of chromosome 6.[4] It is 23,379 nucleotides long, spanning from nucleotide 10671418 to 10694797 and has a molecular weight of 17,383 Da with 9 different exons. C6orf52 has no common aliases although the major protein product is sometimes referred to as "Q5T4I8".[5]
mRNA
[edit]C6orf52 is known to undergo alternative splicing and has six known isoforms of varying length.
Proteins
[edit]Isoforms
[edit]Q5T4I8 has six known isoforms of varying amino acid length.
Isoform | Polypeptide Length |
---|---|
X1 | 207 |
X2 | 182 |
X3 | 177 |
X4 | 176 |
X5 | 126 |
X6 | 93 |
Composition
[edit]The protein composition is relatively high in glutamic acid and serine residue levels and is relatively low in tryptophan and arginine when compared to the average human protein composition.[6][7]
C6orf52 has two commonly predicted post-translational modifications present in the highly conserved domain.[8][9] The lysine at position 123 (of the major protein) within the highly conserved domain is expected to undergo sumoylation often, while the tyrosine at position 128 is expected to undergo phosphorylation. Sumoylation sites allow for the binding of SUMO (small ubiquitin-like modifier protein) which are known to alter different functional parameters of proteins such as subcellular localization, protein parenting, DNA binding and transactivation functions of transcription factors.[10] Tyrosine phosphorylation is associated with many things, namely growth factor signaling and cell differentiation during development which are recurring aspects of C6orf52.[11]
Structure
[edit]The secondary structure of C6orf52 consists mostly of coiled regions, however there is an extended alpha helix region within the highly conserved domain.[12][13]
Subcellular localization
[edit]It is predicted to be a non-transmembrane protein that is located within the nucleus.[14]
Expression
[edit]Tissue expression is highest within the oocyte, with high expression in the testes and female gonad.[5]
Expression is extremely high (2000-3000 transcripts per million) in the first stages of embryonic development up until the blastocyst.
Clinical Significance
[edit]Two proteins in cattle that have been linked to fat or energy metabolism were predicted to be similar to C6orf52, however there is no known clinical study done examining C6orf52.[15]
Homology
[edit]Paralogs
[edit]C6orf52 has one identified paralog, tRNA selenocysteine 1-associated protein 1 (TRNAU1AP), which is located on chromosome one at 1p35.3.[3] TRNAU1AP is involved selenocysteine biosynthesis, selenoproteins synthesis efficiency enhancement and may be involved in the methylation of tRNA(Sec).[16]
Orthologs
[edit]C6orf52 is conserved through many species. It can be found it many mammals, reptiles, and birds, such as the Zebra Finch.[17]
Scientific Name | Name | Accession | Sequence Similarity % | Date of Divergence (Estimated MYA)[18] |
---|---|---|---|---|
Sus scrofa | Wild Boar | NP_001138494.1 | 60.976 | 96 |
Taeniopygia guttata | Zebra Finch | XP_004175377.1 | 56.90 | 312 |
Ailuropoda melanoleuca | Giant Panda | XP_019651607.1 | 64.43 | 96 |
Pelodiscus sinensis | Chinese Softshell Turtle | XP_006138812.1 | 42.55 | 312 |
Canis lupus familiaris | Dog | XP_005640089.1 | 64.29 | 96 |
Pan troglodytes | Common Chimpanzee | XP_009448762.2 | 98.03 | 6.65 |
Macaca mulatta | Rhesus macaque | NP_001180810.1 | 94.08 | 29.44 |
There is a domain of high conservation across species starting near the last third of the polypeptide.
References
[edit]- ^ "NCBI(National Center for Biotechnology Information)". Retrieved 2019-02-22.
- ^ "NCBI(National Center for Biotechnology Information)". Virginia Medical. 105 (4): 272–277. April 1978. Retrieved 2019-02-25.
- ^ a b "NCBI(National Center for Biotechnology Information)". Retrieved 2019-02-25.
- ^ "HUGO Gene Symbol Report". Retrieved 2019-02-25.
- ^ a b "UniProt". Retrieved 2019-02-25.
- ^ "The Institute for Environmental Modeling (TIEM)". Retrieved 2019-05-05.
- ^ "ProtScale - Amino Acid Composition (in UniProKB/Swiss-Prot data bank)". Retrieved 2019-05-05.
- ^ "NetPhos 3.1 Server". Retrieved 2019-05-05.
- ^ "SUMOplot Analysis Program - Abgent". Retrieved 2019-05-05.
- ^ Hilgarth, Roland S.; Murphy, Lynea A.; Skaggs, Hollie S.; Wilkerson, Donald C.; Xing, Hongyan; Sarge, Kevin D. (2004). "JBC - Regulation and Function of SUMO Modification". Journal of Biological Chemistry. 279 (52): 53899–53902. doi:10.1074/jbc.R400021200. PMID 15448161.
- ^ Pasantes-Morales, H.; Franco, R. (2002). "Influence of protein tyrosine kinases on cell volume change-induced taurine release". Cerebellum (London, England). 1 (2): 103–9. doi:10.1080/147342202753671231. PMID 12882359. S2CID 9909209.
- ^ "NPS@: Network Protein Sequence Analysis - SOPMA". Retrieved 2019-05-05.
- ^ "I-TASSER server for protein structure and function prediction". Retrieved 2019-05-05.
- ^ "PSORT II Prediction". Retrieved 2019-05-05.
- ^ Chow, Y. W.; Pietranico, R.; Mukerji, A. (1975). "NCBI(National Center for Biotechnology Information)". Biochemical and Biophysical Research Communications. 66 (4): 1424–31. doi:10.1016/0006-291x(75)90518-5. PMID 6.
- ^ "UniProt - TRNAU1AP - tRNA selenosysteine 1-associated protein 1 - Homo sapiens". Retrieved 2019-05-06.
- ^ "BLAST NCBI". Retrieved 2019-02-25.
- ^ "TimeTree: The Timescale of Life". Retrieved 2019-02-25.