VOOZH about

URL: https://scite.ai/reports/are-bigger-data-sets-better-nE4Nn6

⇱ Are Bigger Data Sets Better for Machine Learning? Fusing Single-Point and Dual-Event Dose Response Data for <i>Mycobacterium tuberculosis</i>


Skip to main content
2014
DOI: 10.1021/ci500264r |Get access via publisher |Summarize |
|
Sign up to set email alerts

Are Bigger Data Sets Better for Machine Learning? Fusing Single-Point and Dual-Event Dose Response Data for Mycobacterium tuberculosis

Abstract: Tuberculosis is a major neglected disease for which the quest to find new treatments continues. There is an abundance of data from large phenotypic screens in the public domain against Mycobacterium tuberculosis (Mtb). Since machine learning methods can learn from past data, we were interested in addressing whether more data builds better models. We now describe using Bayesian machine learning to assess whether we can improve our models by combining the large quantities of single-point data with the much small… Show more

Search citation statements

Order By: Relevance

Paper Sections

Select...
30
15
7
2

Citation Types

3
45
0
0

Year Published

2014
20142026
2026

Publication Types

Select...
39
4
3

Relationship

25
21

Authors

Journals

👁 citations

Cited by 43 publications

(48 citation statements)
👁 references

References 60 publications

3
45
0
0
Order By: Relevance
“…The high-throughput docking/Bayesian workflow was, thus, able to find a small molecule (JSF-2164) with promising InhA inhibition, whole-cell activity, and modest SI versus Vero cells for a virtual screening hit. The hit rate of 2/19 (10.5%) compounds with acceptable antitubercular efficacy is similar to hit rates we have witnessed when implementing Bayesian models in previous ligand-based screens, which are more cost- and time-efficient than experimental HTS against M. tuberculosis (hit rates <1%). Additionally, the Bayesian approach removed from consideration whole-cell inactive compounds, as the TAACF-CB2 model’s lowest scoring five candidates all exhibited MIC values between 125 and 250 μM.…”
Section: Discussionsupporting
confidence: 71%
Exaggerated anticipatory anxiety is common in social anxiety disorder (SAD). Neuroimaging studies have revealed altered neural activity in response to social stimuli in SAD, but fewer studies have examined neural activity during anticipation of feared social stimuli in SAD. The current study examined the time course and magnitude of activity in threat processing brain regions during speech anticipation in socially anxious individuals and healthy controls (HC). Method Participants (SAD n = 58; HC n = 16) underwent functional magnetic resonance imaging (fMRI) during which they completed a 90s control anticipation task and 90s speech anticipation task.
“…The high-throughput docking/Bayesian workflow was, thus, able to find a small molecule (JSF-2164) with promising InhA inhibition, whole-cell activity, and modest SI versus Vero cells for a virtual screening hit. The hit rate of 2/19 (10.5%) compounds with acceptable antitubercular efficacy is similar to hit rates we have witnessed when implementing Bayesian models in previous ligand-based screens, which are more cost- and time-efficient than experimental HTS against M. tuberculosis (hit rates <1%). Additionally, the Bayesian approach removed from consideration whole-cell inactive compounds, as the TAACF-CB2 model’s lowest scoring five candidates all exhibited MIC values between 125 and 250 μM.…”
Section: Discussionsupporting
confidence: 71%
Exaggerated anticipatory anxiety is common in social anxiety disorder (SAD). Neuroimaging studies have revealed altered neural activity in response to social stimuli in SAD, but fewer studies have examined neural activity during anticipation of feared social stimuli in SAD. The current study examined the time course and magnitude of activity in threat processing brain regions during speech anticipation in socially anxious individuals and healthy controls (HC). Method Participants (SAD n = 58; HC n = 16) underwent functional magnetic resonance imaging (fMRI) during which they completed a 90s control anticipation task and 90s speech anticipation task.
“…We found that the larger Mtb model slightly edged out the smaller ChEMBL data set, but the statistics were nearly identical. In a previous study we saw a similar plateauing of model ROC when assessing different dataset sizes 18 .…”
Section: Discussionsupporting
confidence: 67%
“…In a previous study, we saw a similar plateauing of model ROC when assessing different data set sizes. 18 One method to expand the usefulness of these larger models using Bayesian models would be to utilize a nearest neighbors approach. If a model weighted the final prediction score based on both the Bayesian score and the kNN score (weighted by the model size), this may yield more accurate predictions.…”
Section: Molecular Pharmaceuticssupporting
confidence: 76%
“…Machine learning has been similarly used in many areas of biomedical and environmental research 8388 . This study is a natural extension of our prior machine learning studies applied to Mtb 911, 1318 which demonstrated an enhancement in hit rate. To further leverage public data to enhance the performance of our models, we have now curated a large, chemically diverse Mtb dataset through careful analysis of in vitro data mined from various sources, such as the primary literature and ChEMBL, and also included in vivo data.…”
Section: Discussionmentioning
confidence: 76%
See 1 more Smart Citation
Exaggerated anticipatory anxiety is common in social anxiety disorder (SAD). Neuroimaging studies have revealed altered neural activity in response to social stimuli in SAD, but fewer studies have examined neural activity during anticipation of feared social stimuli in SAD. The current study examined the time course and magnitude of activity in threat processing brain regions during speech anticipation in socially anxious individuals and healthy controls (HC). Method Participants (SAD n = 58; HC n = 16) underwent functional magnetic resonance imaging (fMRI) during which they completed a 90s control anticipation task and 90s speech anticipation task.
“…The ROC values for the M. tuberculosis models are comparable to those published recently using a commercial tool. For example, in this study MLSMR single-point model three-fold ROC = 0.87 (Figure S1, five-fold ROC 0.87, leave out 50% × 100 cross-validation ROC = 0.86) and MLSMR dose–response model three-fold cross-validation ROC = 0.75 (leave out 50% × 100 cross-validation ROC = 0.73), M. tuberculosis efficacy in mouse three-fold ROC = 0.73 (five-fold ROC = 0.73), and Ames mutagenicity three-fold ROC = 0.83 (five-fold ROC = 0.84).…”
Section: Resultsmentioning
confidence: 71%
Exaggerated anticipatory anxiety is common in social anxiety disorder (SAD). Neuroimaging studies have revealed altered neural activity in response to social stimuli in SAD, but fewer studies have examined neural activity during anticipation of feared social stimuli in SAD. The current study examined the time course and magnitude of activity in threat processing brain regions during speech anticipation in socially anxious individuals and healthy controls (HC). Method Participants (SAD n = 58; HC n = 16) underwent functional magnetic resonance imaging (fMRI) during which they completed a 90s control anticipation task and 90s speech anticipation task.
👁 scite logo

Scite is an AI-powered platform that helps researchers discover and evaluate scientific literature through Smart Citations, showing whether studies support or contradict a claim. Now part of Research Solutions, Scite has indexed 1.6B+ citations, partners with 30+ publishers, and serves 2M users worldwide.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Copyright © 2026 Scite LLC. All rights reserved.

Made with 💙 for researchers

Part of the Research Solutions Family.

👁 Research Solutions Logo