Waggawagga-CLI: A command-line tool for predicting stable single α-helices (SAH-domains), and the SAH-domain distribution across eukaryotes

@article{Simm2018WaggawaggaCLIAC,
  title={Waggawagga-CLI: A command-line tool for predicting stable single $\alpha$-helices (SAH-domains), and the SAH-domain distribution across eukaryotes},
  author={Dominic Simm and Martin Kollmar},
  journal={PLoS ONE},
  year={2018},
  volume={13}
}
Stable single-alpha helices (SAH-domains) function as rigid connectors and constant force springs between structural domains, and can provide contact surfaces for protein-protein and protein-RNA interactions. SAH-domains mainly consist of charged amino acids and are monomeric and stable in polar solutions, characteristics which distinguish them from coiled-coil domains and intrinsically disordered regions. Although the number of reported SAH-domains is steadily increasing, genome-wide analyses… 

Figures and Tables from this paper

Protein function prediction in genomes: Critical assessment of coiled-coil predictions based on protein structure data
TLDR
The most commonly used coiled-coil prediction tools are re-evaluated with respect to the most comprehensive reference data set available, the entire Protein Data Base (PDB), down to each amino acid and its secondary structure.
Detection of single alpha-helices in large protein sequence sets using hardware acceleration.
Systematic identification of conditionally folded intrinsically disordered regions by AlphaFold2
TLDR
A large majority of IDR sequences in the proteomes of human and other eukaryotes would be expected to function in the absence of conditional folding, and up to 80% of IDRs in archaea and bacteria are predicted to conditionally fold, but less than 20% of eUKaryotic IDRs.
Critical assessment of coiled-coil predictions based on protein structure data
TLDR
The most commonly used coiled-coil prediction tools are re-evaluated with respect to the most comprehensive reference data set available, the entire Protein Data Bank, down to each amino acid and its secondary structure to indicate that these predictions should be treated very cautiously and need to be supported and validated by experimental evidence.
Disentangling the complexity of low complexity proteins
TLDR
It is argued that statistical measures alone cannot capture all structural aspects of LCRs and recommend the combined usage of a variety of predictive tools and measurements.
Accelerating Charged Single α-helix Detection on FPGA
TLDR
A new architecture is proposed that can perform search for CSAH 32 times faster compared to the authors' previous FPGA implementation and is compared to the two design approaches in terms of speed, implementation and accuracy.
A glutamine-based single ɑ-helix scaffold to target globular proteins
TLDR
Rules to design peptides that fold into single ɑ-helices are presented by instead concatenating glutamine side chain to main chain hydrogen bonds recently discovered in polyglutamine helices, which are uncharged, contain only natural amino acids, and can be optimized to interact with specific targets.

References

SHOWING 1-10 OF 33 REFERENCES
Distribution and evolution of stable single α-helices (SAH domains) in myosin motor proteins
TLDR
The largest available myosin sequence dataset is analysed consisting of 7919 manually annotated myOSin sequences from 938 species representing all major eukaryotic branches using the SAH-prediction algorithm of Waggawagga, a recently developed tool for the identification ofSAH-domains.
When a predicted coiled coil is really a single α-helix, in myosins and other proteins
TLDR
This review summarises recent findings on SAH domains, their properties, their potential functions and some clues on how to recognise them.
Charged single α‐helix: A versatile protein structural motif
TLDR
It is shown that these sequences represent a novel structural motif called “charged single α‐helix” (CSAH), which is based on sequence features characteristic for salt bridge stabilizedSingle α‐helices, and possible functional roles of the corresponding segments are revealed.
Harnessing the Unique Structural Properties of Isolated α-Helices*
TLDR
The structure and function of SAH domains are reviewed, as well as the tools to identify them in natural proteins, with a discussion of recent studies that have successfully used the modular ER/K linker for engineering chimeric myosin proteins with altered mechanical properties.
Waggawagga: comparative visualization of coiled-coil predictions and detection of stable single α-helices (SAH domains)
TLDR
UNLABELLED Waggawagga is a web-based tool for the comparative visualization of coiled-coil predictions and the detection of stable single α-helices (SAH domains) and a window-based score has been developed to predict SAH domains.
Dynamic charge interactions create surprising rigidity in the ER/K α-helical protein motif
TLDR
The significant rigidity of the ER/K α-helix can help regulate protein function, as a force transducer between protein subdomains, making it a promising tool in designing synthetic proteins.
Characterization of long and stable de novo single alpha-helix domains provides novel insight into their stability
TLDR
Combining a PDB analysis with molecular modelling provides a rational explanation, demonstrating that Glu and Arg form salt bridges more commonly, utilize a wider range of rotamer conformations, and are more dynamic than Glu–Lys.
An exceptionally stable helix from the ribosomal protein L9: implications for protein folding and stability.
TLDR
Results show that a peptide corresponding to the central helix of L9 is monomeric in aqueous solution and >85% helical at 1 degrees C and 68(+/-7)% helicals at 25 degrees C, considerably more helical than any other protein fragment studied to date.
The Predicted Coiled-coil Domain of Myosin 10 Forms a Novel Elongated Domain That Lengthens the Head*
Myosin 10 contains a region of predicted coiled coil 120 residues long. However, the highly charged nature and pattern of charges in the proximal 36 residues appear incompatible with coiled-coil
...
...