
pmid: 41562299
pmc: PMC12820801
Abstract FAALs (fatty acyl‐AMP ligases) recruit and incorporate fatty acids during the biosynthesis of secondary metabolites. Their diversity, distribution, and substrate specificity remain poorly understood, which limits functional predictions from sequence data. In this study, we explored the prevalence and diversity of FAAL enzymes across the tree of life and show that these are widely distributed in secondary metabolite‐rich bacteria and predominantly in the context of polyketide and non‐ribosomal peptide biosynthetic pathways. FAALs were also found to be present in certain archaea and eukaryotic groups. The phylogenetic placement of FAALs was not correlated to the chain length of the fatty acids that they activate and load. Therefore, we developed a bioinformatics and AI workflow (FAALPred) to predict the chain length of the fatty‐acid substrate of a given FAAL sequence. The robustness and accuracy of the predictions generated by FAALPred were validated using independent in vitro and in silico data. We anticipate that FAALPred will not only accelerate secondary metabolite structural predictions and subsequent discovery from FAAL‐associated pathways but also facilitate engineering of lipoylation. FAALPred is available at https://faalpred.ciimar.up.pt/ .
Ligases, Bacteria, Bacterial Proteins, Fatty Acids, Computational Biology, Phylogeny, Research Article, Substrate Specificity
Ligases, Bacteria, Bacterial Proteins, Fatty Acids, Computational Biology, Phylogeny, Research Article, Substrate Specificity
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 0 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
