Downloads provided by UsageCounts
Datasets for "Deep indel mutagenesis reveals the impact of insertions and deletions on protein stability and function ". Code is available at: https://github.com/lehner-lab/deep_indel_mutagenesis. Amino acid insertions and deletions (indels) are an abundant class of genetic variants. However, compared to substitutions, the effects of indels on protein stability are not well understood and are poorly predicted. To better understand indels here we analyze new and existing large-scale deep indel mutagenesis (DIM) of structurally diverse proteins. The effects of indels on protein stability vary extensively among and within proteins and are not well predicted by existing computational methods. To address this shortcoming we present INDELi, a series of models that combine experimental or predicted substitution effects and secondary structure information to provide good prediction of the effects of indels on both protein stability and pathogenicity. Moreover, quantifying the effects of indels on protein-protein interactions suggests that insertions can be an important class of gain-of-function variants. Our results provide an overview of the impact of indels on proteins and a method to predict their effects genome-wide.additional_files.zipadditional_dfs.rdstsuboyama_sec_struc.rds DiMSum.zipDiMSum output in form of RData:aPCA_domains_fitness_replicates.RDataaPCA_domains_variant_data_merge.RDatagrb2_bind_fitness_replicates.RDatagrb2_fold_fitness_replicates.RDatapdz3_bind_fitness_replicates.RDatapdz3_fold_fitness_replicates.RDataInput files to reproduce the DiMSum run. Please donwload the raw fastq at Gene Expression Omnibus under theaccession number GSE244096.:aPCA_DiMSum_run.shaPCA_experimental_design_file.txtaPCA_synonymSequencePathaPCA_VariantIdentityaPCA_grb2sh3_DiMSum_run.shaPCA_grb2sh3_experimental_design_file.txtaPCA_grb2sh3_VariantIdentity.txtaPCA_psd95pdz3_DiMSum_run.shaPCA_psd95pdz3_experimental_design_file.txtaPCA_psd95pdz3_VariantIdentity.txtbPCA_grb2sh3_DiMSum_run.shbPCA_grb2sh3_experimental_design_file.txtbPCA_grb2sh3_VariantIdentity.txtbPCA_psd95pdz3_DiMSum_run.shbPCA_psd95pdz3_experimental_design_file.txtbPCA_psd95pdz3_VariantIdentity.txt pre_processed_data.zip color_scale.rdsscaled_variants_aPCA.rdsscaled_variants_bPCA.rdstsuboyama_nat_doms_all.rds indel_prediction_models.zip ddmut_prediction_mean.rdsGEMME_pred.rdsddG_ml_encoded.rdsddG_insertions_models.RddG_deletions_models.Rgenome_wide_prediction.zip ddG_ml_encoded.rdsClinVar_benchmarkINDELiE_predictions_human_proteome
| citations This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 0 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
| views | 15 | |
| downloads | 9 |

Views provided by UsageCounts
Downloads provided by UsageCounts