Cystic fibrosis is caused by biallelic pathogenic variants in the CFTR gene, which contains a polymorphic (TG)mTn sequence (the "poly-T/TG tract") in intron 9. While T9 and T7 alleles are benign, T5 alleles with longer TG repeats, e.g., (TG)12T5 and (TG)13T5, are clinically significant. Thus, professional medical societies currently recommend reporting the TG repeat size when T5 is detected. Sanger sequencing is a cost-effective method of genotyping the (TG)mTn tract; however, its polymorphic length substantially complicates data analysis. We developed CFTR-TIPS, a freely available web-based software tool that infers the (TG)mTn genotype from Sanger sequencing data. This tool detects the (TG)mTn tract in the chromatograms, quantifies goodness of fit with expected patterns, and visualizes the results in a graphical user interface. It is broadly compatible with any Sanger chromatogram that contains the (TG)mTn tract ± 15 bp. We evaluated CFTR-TIPS using 835 clinical samples previously analyzed in a CLIA-certified, CAP-accredited laboratory. When operated fully automatically, CFTR-TIPS achieved 99.8% concordance with our clinically validated manual workflow, while generally taking less than 10 s per sample. There were two discordant samples: one due to a co-occurring heterozygous duplication that confounded the tool and the other due to incomplete (TG)mTn tract detection in the reverse chromatogram. No clinically significant misclassifications were observed. CFTR-TIPS is a free, accurate, and rapid tool for CFTR (TG)mTn tract genotyping using cost-effective Sanger sequencing. This tool is suitable both for automated use and as an aid to manual review to enhance accuracy and reduce analysis time.
Keywords: CFTR; Sanger sequencing; cystic fibrosis; molecular diagnostics; poly-T tract.