Palantir: a springboard for the analysis of secondary metabolite gene clusters in large-scale genome mining projects

Loïc Meunier; Pierre Tocquin; Luc Cornet; Damien Sirjacobs; Valérie Leclère; Maude Pupin; Philippe Jacques; Denis Baurain

doi:10.1093/bioinformatics/btaa517

Palantir: a springboard for the analysis of secondary metabolite gene clusters in large-scale genome mining projects

Bioinformatics. 2020 Aug 1;36(15):4345-4347. doi: 10.1093/bioinformatics/btaa517.

Authors

Loïc Meunier^{1

2}, Pierre Tocquin^{3

4}, Luc Cornet⁵, Damien Sirjacobs¹, Valérie Leclère⁶, Maude Pupin^{7

8}, Philippe Jacques², Denis Baurain^{1

4}

Affiliations

¹ InBioS-PhytoSYSTEMS, Eukaryotic Phylogenomics, University of Liège, B-4000 Liège, Belgium.
² Microbial Processes and Interactions, TERRA Teaching and Research Centre, Joint Research Unit BioEcoAgro UMRT 1158, Gembloux Agro-Bio Tech, University of Liège, B-5030 Gembloux, Belgium.
³ InBioS-PhytoSYSTEMS, Plant Physiology, University of Liège, B-4000 Liège, Belgium.
⁴ Hedera-22 SCRL, B-4130 Tilff, Belgium.
⁵ GIGA institute, Medical Genomics-Unit of Animal Genomics, University of Liège, B-4000 Liège, Belgium.
⁶ Univ. Lille, INRA, ISA, Univ. Artois, Univ. Littoral Côte d'Opale, EA 7394-ICV-Institut Charles Viollette, Joint Research Unit BioEcoAgro UMRT 1158, F-59000 Lille, France.
⁷ UMR 9189- CRIStAL- Centre de Recherche en Informatique Signal et Automatique de Lille, University of Lille, CNRS, Centrale Lille, F-59000 Lille, France.
⁸ Bonsai Team, Inria-Lille Nord Europe, F-59655 Villeneuve d'Ascq Cedex, France.

PMID: 32415965
DOI: 10.1093/bioinformatics/btaa517

Abstract

Summary: To support small and large-scale genome mining projects, we present Post-processing Analysis tooLbox for ANTIsmash Reports (Palantir), a dedicated software suite for handling and refining secondary metabolite biosynthetic gene cluster (BGC) data annotated with the popular antiSMASH pipeline. Palantir provides new functionalities building on NRPS/PKS predictions from antiSMASH, such as improved BGC annotation, module delineation and easy access to sub-sequences at different levels (cluster, gene, module and domain). Moreover, it can parse user-provided antiSMASH reports and reformat them for direct use or storage in a relational database.

Availability and implementation: Palantir is released both as a Perl API available on CPAN (https://metacpan.org/release/Bio-Palantir) and as a web application (http://palantir.uliege.be). As a practical use case, the web interface also features a database built from the mining of 1616 cyanobacterial genomes, of which 1488 were predicted to encode at least one BGC.

Supplementary information: Supplementary data are available at Bioinformatics online.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Bacteria / genetics
Biosynthetic Pathways*
Molecular Sequence Annotation
Multigene Family
Software*