Apply the clustering method used in Dunham & Beltrao (2020) to a new deep mutational landscape dataset based on
a standardised deep_mutational_scan dataset.
recluster( x, keep_clustering = FALSE, deep_split = NULL, permissive = 0.4, add_combined = TRUE, cols = NULL, method = "average", ... )
| x |
|
|---|---|
| keep_clustering | logical. Keep the outputs of hclust and cutreeHybrid for downstream analysis |
| deep_split | Named vector of deepSplit parameters to pass to cutreeHybrid. Must be a numeric vector with a named entry for each amino acid or a single integer to apply to all amino acids |
| permissive | Absolute value threshold for considering a substitution permissive. Positions with all substitution scores below this threshold are assigned to the permissive subtype. |
| add_combined | Combine the supplied data with the deep_landscape dataset. |
| cols | Columns to cluster on, defaults to PC2:20 |
| method |
|
| ... | Additional arguments passed on to |
A tibble containing the clustered data.
This will be most valuable if x is a large multi_study dataset including quite a few new scans,
otherwise results will either be similar to the original dataset if add_combined = TRUE or based on too few
positions to be meaningful if not.
The default parameters apply the procedure used in the paper and a few parameters are provided to allow some adjustments or experimentation. Larger more novel changes will likely require adapting the code itself.