Summary: | Per- and polyfluorinated alkyl substances (PFAS), known for their widespread environmental presence and slow degradation, pose significant concerns. Of the approximately 10,000 known PFAS, only a few have undergone comprehensive testing, resulting in limited experimental data. In this study, we employed a combination of physics-based methods and data-driven models to address gaps in PFAS bioaccumulation potential. Using the COnductor-like Screening MOdel for Realistic Solvents (COSMO-RS) method, we predicted n-octanol/water partition coefficients (logKOW), crucial for PFAS bioaccumulation. Our developed Quantitative Structure-Property Relationship (QSPR) model exhibited high accuracy (R2 = 0.95, RMSEC = 0.75) and strong predictive ability (Q2LOO = 0.93, RMSECV = 0.83). Leveraging the extensive NORMAN, we predicted logKOW for over 4,000 compounds, identifying 244 outliers out of 4519. Further categorizing the database into eight Organisation for Economic Co-operation and Development (OECD) categories, we confirmed fluorine atoms role in enhanced bioaccumulation. Utilizing predicted logKOW, water solubility logSW, and vapor pressure logVP values, we calculated additional physicochemical properties that are responsible for the transport and dispersion of PFAS in the environment. Parameters such as Henry’s Law (kH), air–water partition coefficient (KAW), octanol–air coefficient (KOA), and soil adsorption coefficient (KOC) exhibited favorable correlations with literature data (R2 > 0.66). Our study successfully filled data gaps, contributing to the understanding of ubiquitous PFAS in the environment and estimating missing physicochemical data for these compounds.
|