ECB-FEAT-23041431

From EchinoWiki
Jump to navigation Jump to search

Strongylocentrotus purpuratus Genome Assembly v5.0

The gene model for LOC594236 appears to be incorrectly annotated, with the current model actually being composed of two loci encoding separate proteins. One line of evidence that supports this assertion include the alignment of only approximately two-thirds of the protein encoded by the current gene model to its reciprocal best hit (RPH) in Lytechinus variegatus (LOC121431890), a closely related echinoid species, according to BLASTp. The Lytechinus variegatus RPH is encoded by an mRNA (XM_041629670.1) with a 5’-UTR with multiple truncated reading frames and stop codons based on the ExPASy translate tool (Gasteiger et al., 2003), so it likely encodes a complete protein sequence. Another line of evidence for a fused gene misannotation is the detection of two different signal peptides (amino acids 1-23 and 1921-1939) encoded by the current model in SignalP (Teufel et al., 2022), which suggests that the model likely encodes more than one protein.


LOC121431890 in Lytechinus Variegatus Only Aligning to Section of LOC594236 in Strongylocentrotus purpuratus in BLASTp
ExPASy Results for 5'-UTR of mRNA Encoded by LOC121431890 in Lytechinus variegatus
Incorrect Gene Model for LOC594236 for Strongylocentrotus purpuratus showing Two Signal Peptides at Amino Acids 1-23 (Left) and 1921-1939 (Right) in ExPASy