Voozh

DeepKeep Launches Vibe AI Red Teaming: A New Approach to AI Security

DeepKeep is introducing Vibe AI Red Teaming, a new approach that combines human expertise with AI-driven execution.

The 45-Minute AI Lobotomy: Why Built-In Guardrails Are Dead

With open-source tools like Heretic performing a 45-minute lobotomy to effortlessly erase an AI's built-in safety guardrails, organizations must abandon the illusion that models can police themselves.

👁 Image

The AI Red Teaming Reality Check: How DeepKeep Delivers on OWASP

The OWASP v1.0 AI Red Teaming standard is the new benchmark for enterprise resilience. Read how DeepKeep ditches static jailbreaks for dynamic, context-aware testing across your entire agentic workflow.

👁 ControlNet rotten apple

A Rotten Apple Spoils the Image Generation

Poisoned training samples can turn ControlNet into a hidden backdoor. From a security perspective, this is not a noisy exploit. It is a sleeper agent waiting for the right signal.

👁 Image

Why LLM-as-a-Judge Isn't Enough

Let one AI keep an eye on another AI feels like putting a referee in the game. In reality, LLM-as-a-judge isn’t the silver bullet some people wish it was.

👁 Image

Multimodal AI is Smarter. Unfortunately, so are The Attacks.

AI has gotten good at understanding not just what we type, but what we show. This shift has made AI more powerful. Unfortunately, it has also made it more vulnerable.

👁 Image

You Can’t “Detect” a Jailbreak. Here’s What to Do Instead

Everyone is looking for an efficient way to detect and block jailbreaks, but here’s the uncomfortable truth: you can’t reliably detect every jailbreak, and trying to chase them all is a losing game.

👁 Image

Two Smart AI Models. Zero Common Sense.

AI is no longer a one-trick tool. It writes reports, analyzes photos, answers complex questions, and even kicks off real-world actions. Most of this power comes from two areas working side by side: Generative AI and Computer Vision.

👁 Image

Agentic AI Security: The Attack Surface Nobody Mapped Yet

AI agents don't just answer questions. They act. That means the blast radius of a security failure has expanded dramatically. Here's the attack surface most teams haven't mapped yet.

👁 Image

DeepKeep Selected as EIC Accelerator Winner: Europe Bets on AI Security

DeepKeep has been awarded €2.5M in blended finance through the EIC Accelerator's October 2024 cut-off. The co-funded project: Multimodal Models with AI-Native Security and Trustworthiness - a recognition that securing AI across LLMs, computer vision, spatial sensing, and multimodal systems isn't a nice-to-have. It's infrastructure.

👁 Image

Top Three Scenarios for PII Leakage in GenAI

Comprehensive PII detection combines scanning of data, penetration testing and a real-time AI firewall

👁 Image

What Is Prompt Injection? How It Works and How to Stop It

Prompt injection is the most exploited vulnerability in AI systems today, and one of the hardest to fully fix. Here's what it is, why it's structural, and how to build a defense that actually holds.

👁 Image

What is AI Red Teaming? A Practical Guide

Red teaming AI systems isn't the same as traditional pen testing. The attack surface is different, the methods are different, and a one-time exercise won't keep you safe. Here's what it actually involves.

👁 Image

DeepKeep Launches GenAI Risk Assessment Module

Evaluating model resilience is paramount, particularly during its inference phase in order to provide insights into the model's ability to handle various scenarios effectively

👁 Image

DeepKeep Comes out of Stealth to Safeguard GenAI with AI-Native Security and Trustworthiness

DeepKeep offers AI-Native security and trustworthiness that secures AI throughout its entire lifecycle

👁 Image

Meta’s LlamaV2 7B LLM Suffers from Susceptibility to DoS and Data Leakage

DeepKeep's evaluation of LlamaV2 7B's security and trustworthiness found strengths in task performance and ethical commitment, with areas for improvement in handling complex transformations, addressing bias, and enhancing security against sophisticated threats

View all

URL: https://www.deepkeep.ai/blog

⇱ AI Security Blog | DeepKeep

DeepKeep Launches Vibe AI Red Teaming: A New Approach to AI Security

The 45-Minute AI Lobotomy: Why Built-In Guardrails Are Dead

The AI Red Teaming Reality Check: How DeepKeep Delivers on OWASP

A Rotten Apple Spoils the Image Generation

Why LLM-as-a-Judge Isn't Enough

Multimodal AI is Smarter. Unfortunately, so are The Attacks.

You Can’t “Detect” a Jailbreak. Here’s What to Do Instead

Two Smart AI Models. Zero Common Sense.

Agentic AI Security: The Attack Surface Nobody Mapped Yet

DeepKeep Selected as EIC Accelerator Winner: Europe Bets on AI Security

Top Three Scenarios for PII Leakage in GenAI

What Is Prompt Injection? How It Works and How to Stop It

What is AI Red Teaming? A Practical Guide

DeepKeep Launches GenAI Risk Assessment Module

DeepKeep Comes out of Stealth to Safeguard GenAI with AI-Native Security and Trustworthiness

Meta’s LlamaV2 7B LLM Suffers from Susceptibility to DoS and Data Leakage