Wolfram Language

Discover the Structural Complexity of Proteins

This example will compare how the structural complexity of proteins varies by their function. To assess the structural complexity, the count of features of secondary structures (such as helices and sheets) will be used. Information about proteins is available from the "Protein" entity type.

These are three sample proteins.

This gathers the protein entities to obtain their secondary structure and molecular function.

For each protein, count the number of secondary structures.

With the molecular functions and feature count for each protein, one can obtain the feature counts for each function. As the number of features can vary widely per protein, the median of the structure count is used in the following.

With the median number of features per function, through reverse lookup one can now find the proteins for each function that have the median feature count.

To show the variety of structural complexity, find a ribbon diagram for an example median protein for each function, where available.

Related Examples

ja