ai-robustness

Here are 3 public repositories matching this topic...

David-Egea / VISION

VISION is a framework for robust and interpretable code vulnerability detection using counterfactual data augmentation. It leverages GNNs, LLM-generated counterfactuals, and graph-based explainability to mitigate spurious correlations and improve generalization on real-world vulnerabilities (CWE-20).

spurious-correlations vulnerability-detection joern illuminati counterfactuals gnn-architectures augmentation-pipeline devign explainability-ai ai-robustness cwe-20

Updated
Jupyter Notebook

33k0 / PALADIN-Framework

Star

A Framework for Robust, Self-Recovering Tool-Using Language Model Agents — trained on 50K+ failure-annotated trajectories for fault-tolerant reasoning and recovery.

open-source fault-tolerance reproducible-research language-model error-recovery resilience robustness failure-injection paladin llm open-dataset tool-using-agent ai-robustness toolbench self-recovering-agents recovery-dataset

Updated
Python

Parikshith-S / gradient_noise_paradox

Star

Investigating the "Gradient Noise Paradox" in AI Safety: A study on the conflict between Differential Privacy (DP-SGD) and Adversarial Training. Uses a custom "Shadow Model" pipeline to synchronize Opacus with PGD attacks, demonstrating how privacy-preserving noise systematically degrades model robustness

ai-safety ai-robustness

Updated
Python

Improve this page

Add a description, image, and links to the ai-robustness topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the ai-robustness topic, visit your repo's landing page and select "manage topics."

Learn more

URL: https://github.com/topics/ai-robustness

⇱ ai-robustness · GitHub Topics · GitHub