ECB-FEAT-23083531

From EchinoWiki
Jump to navigation Jump to search

Strongylocentrotus purpuratus Genome Assembly v5.0

The gene model for LOC756959 appears to be incorrectly annotated, with the upstream locus, LOC115918705, likely being part of the gene. Lines of evidence that support this assertion include the absence of 5’-UTR within LOC756959 mRNA transcript, XM_030987442.1, and the adjacency of the coding loci on the same reading frame. Furthermore, proteins encoded by each gene models individually aligns to different regions of the protein encoded by clstn1 in mammals when they are input into BLAST. However, the overall gene model for LOC756959 and LOC115918705 likely remains incomplete, for neither of these loci are associated mRNA transcripts with 5’-UTRs. The gene model also does not encode for proteins with signal peptides when analyzed using SMART (Letunic et al., 2006) or SignalP 6.0 (Teufel et al., 2022).

Incomplete Model (LOC756959) for Strongylocentrotus purpuratus in SMART
Incomplete Model (LOC115918705 and LOC756959) for Strongylocentrotus purpuratus in SMART
BLASTp Alignment Comparing Protein Encoded by LOC115918705 from Strongylocentrotus purpuratus (top) to CSTN1 in Mus musculus (bottom)
Blastp Alignment Comparing Protein Encoded by LOC756959 from Strongylocentrotus purpuratus (top) to CSTN1 in Mus musculus (bottom)
Incomplete Model (LOC756959) for Strongylocentrotus purpuratus in SignalP 6.0
Incomplete Model (LOC115918705 and LOC756959) for Strongylocentrotus purpuratus in SignalP 6.0