India's Most Futuristic AI Conference Is Back – Bigger, Sharper, Bolder

Reading list

Overview of generative AI applications and their impact

Introduction to LangChain, ChatGPT and Gemini Pro

What are Large Language Models?GPT models Mistral Llama Gemini How to build diffferent LLM AppIications?

Introduction to Prompt Engineering Best Practices and Guidelines for Prompt Engineering N shot prompting Chain of Thought Tree of Thoughts Skeleton of Thoughts Chain of Emotion

Introduction to Finetuning LLMs Parameter-Efficient Finetuning (PEFT)LORA QLORA using Unsloth using Huggingface

What do you mean by Training LLMs from Scratch?

Intro to the LangChain Ecosystem Core Components of LangChain Applications of LCEL Chains RAG using LangChain LangGraph LangSmith

Introduction to RAG systems Evaluation of RAG systems

Getting Started with LlamaIndex Components of LlamaIndex Advanced approaches for powerful RAG system

Introduction to Stable Diffusion Generating image using Stable diffusion Diffusion models Prompt Engineering Concepts for Stable Diffusion MidJourney Understanding Dalle 3

Enhancing Code Quality with LangGraph Reflection

👁 Adarsh Balan

Adarsh Balan Last Updated : 17 Mar, 2025

6 min read

The LangGraph Reflection Framework is a type of agentic framework which offers a powerful way to improve language model outputs through an iterative critique process using Generative AI. This article breaks down how to implement a reflection agent that validates Python code using Pyright and improves its quality using GPT-4o mini. AI agents play a crucial role in this framework, automating decision-making processes by combining reasoning, reflection, and feedback mechanisms to enhance model performance.

Learning Objectives

Understand how the LangGraph Reflection Framework works.
Learn how to implement the framework to improve the quality of Python code.
Experience how well the framework works through a hands-on trial.

This article was published as a part of the Data Science Blogathon.

LangGraph Reflection Framework Architecture

The LangGraph Reflection Framework follows a simple yet effective agentic architecture:

Main Agent: Generates initial code based on the user’s request.
Critique Agent: Validates the generated code using Pyright.
Reflection Process: If errors are detected, the main agent is called again to refine the code until no issues remain.

👁 LangGraph Reflection framework architecture

Also Read: Agentic Frameworks for Generative AI Applications

How to Implement the LangGraph Reflection Framework

Here is a Step-by-Step Guide for an Illustrative Implementation and Usage:

Step 1: Environment Setup

First, install the required dependencies:

pip install langgraph-reflection langchain pyright

Step 2: Code Analysis with Pyright

We’ll use Pyright to analyze generated code and provide error details.

Pyright Analysis Function

from typing import TypedDict, Annotated, Literal
import json
import os
import subprocess
import tempfile

from langchain.chat_models import init_chat_model
from langgraph.graph import StateGraph, MessagesState, START, END
from langgraph_reflection import create_reflection_graph

os.environ["OPENAI_API_KEY"] = "your_openai_api_key"

def analyze_with_pyright(code_string: str) -> dict:
 """Analyze Python code using Pyright for static type checking and errors.

 Args:
 code_string: The Python code to analyze as a string

 Returns:
 dict: The Pyright analysis results
 """
 with tempfile.NamedTemporaryFile(suffix=".py", mode="w", delete=False) as temp:
 temp.write(code_string)
 temp_path = temp.name

 try:
 result = subprocess.run(
 [
 "pyright",
 "--outputjson",
 "--level",
 "error", # Only report errors, not warnings
 temp_path,
 ],
 capture_output=True,
 text=True,
 )

 try:
 return json.loads(result.stdout)
 except json.JSONDecodeError:
 return {
 "error": "Failed to parse Pyright output",
 "raw_output": result.stdout,
 }
 finally:
 os.unlink(temp_path)

Step 3: Main Assistant Model for Code Generation

GPT-4o Mini Model Setup

def call_model(state: dict) -> dict:
 """Process the user query with the GPT-4o mini model.

 Args:
 state: The current conversation state

 Returns:
 dict: Updated state with the model response
 """
 model = init_chat_model(model="gpt-4o-mini", openai_api_key = 'your_openai_api_key')
 return {"messages": model.invoke(state["messages"])}

Note: Use os.environ[“OPENAI_API_KEY”] = “YOUR_API_KEY” securely, and never hardcode the key in your code.

Step 4: Code Extraction and Validation

Code Extraction Types

# Define type classes for code extraction
class ExtractPythonCode(TypedDict):
 """Type class for extracting Python code. The python_code field is the code to be extracted."""
 python_code: str


class NoCode(TypedDict):
 """Type class for indicating no code was found."""
 no_code: bool

System Prompt for GPT-4o Mini

# System prompt for the model
SYSTEM_PROMPT = """The below conversation is you conversing with a user to write some python code. Your final response is the last message in the list.

Sometimes you will respond with code, othertimes with a question.

If there is code - extract it into a single python script using ExtractPythonCode.

If there is no code to extract - call NoCode."""

Pyright Code Validation Function

def try_running(state: dict) -> dict | None:
 """Attempt to run and analyze the extracted Python code.

 Args:
 state: The current conversation state

 Returns:
 dict | None: Updated state with analysis results if code was found
 """
 model = init_chat_model(model="gpt-4o-mini")
 extraction = model.bind_tools([ExtractPythonCode, NoCode])
 er = extraction.invoke(
 [{"role": "system", "content": SYSTEM_PROMPT}] + state["messages"]
 )
 if len(er.tool_calls) == 0:
 return None
 tc = er.tool_calls[0]
 if tc["name"] != "ExtractPythonCode":
 return None

 result = analyze_with_pyright(tc["args"]["python_code"])
 print(result)
 explanation = result["generalDiagnostics"]

 if result["summary"]["errorCount"]:
 return {
 "messages": [
 {
 "role": "user",
 "content": f"I ran pyright and found this: {explanation}\n\n"
 "Try to fix it. Make sure to regenerate the entire code snippet. "
 "If you are not sure what is wrong, or think there is a mistake, "
 "you can ask me a question rather than generating code",
 }
 ]
 }

Step 5: Creating the Reflection Graph

Building the Main and Judge Graphs

def create_graphs():
 """Create and configure the assistant and judge graphs."""
 # Define the main assistant graph
 assistant_graph = (
 StateGraph(MessagesState)
 .add_node(call_model)
 .add_edge(START, "call_model")
 .add_edge("call_model", END)
 .compile()
 )

 # Define the judge graph for code analysis
 judge_graph = (
 StateGraph(MessagesState)
 .add_node(try_running)
 .add_edge(START, "try_running")
 .add_edge("try_running", END)
 .compile()
 )

 # Create the complete reflection graph
 return create_reflection_graph(assistant_graph, judge_graph).compile()

reflection_app = create_graphs()

Step 6: Running the Application

Example Execution

if __name__ == "__main__":
 """Run an example query through the reflection system."""
 example_query = [
 {
 "role": "user",
 "content": "Write a LangGraph RAG app",
 }
 ]

 print("Running example with reflection using GPT-4o mini...")
 result = reflection_app.invoke({"messages": example_query})
 print("Result:", result)

Output Analysis

👁 enhanced Python code using LangGraph Reflection framework

👁 How LangGraph Reflection framework enhances Python code

What Happened in the Example?

Our LangGraph Reflection system was designed to do the following:

Take an initial code snippet.
Run Pyright (a static type checker for Python) to detect errors.
Use the GPT-4o mini model to analyze the errors, understand them, and generate improved code suggestions

Iteration 1 – Identified Errors

1. Import “faiss” could not be resolved.

Explanation: This error occurs when the faiss library isn’t installed or the Python environment doesn’t recognize the import.
Solution: The agent recommended running:

pip install faiss-cpu

2. Cannot access attribute “embed” for class “OpenAIEmbeddings”.

Explanation: The code referenced .embed, but in newer versions of langchain, embedding methods are .embed_documents() or .embed_query().
Solution: The agent correctly replaced .embed with .embed_query.

3. Arguments missing for parameters “docstore”, “index_to_docstore_id”.

Explanation: The FAISS vector store now requires a docstore object and an index_to_docstore_id mapping.
Solution: The agent added both parameters by creating an InMemoryDocstore and a dictionary mapping.

Iteration 2 – Progression

In the second iteration, the system improved the code but still identified:

1. Import “langchain.document” could not be resolved.

Explanation: The code attempted to import Document from the wrong module.
Solution: The agent updated the import to from langchain.docstore import Document.

2. “InMemoryDocstore” is not defined.

Explanation: The missing import for InMemoryDocstore was identified.
Solution: The agent correctly added:

from langchain.docstore import InMemoryDocstore

Iteration 3 – Final Solution

In the final iteration, the reflection agent successfully addressed all issues by:

Importing faiss correctly.
Switching .embed to .embed_query for embedding functions.
Adding a valid InMemoryDocstore for document management.
Creating a proper index_to_docstore_id mapping.
Correctly accessing document content using .page_content instead of treating documents as simple strings.

The improved code then successfully ran without errors.

Why This Matters

Automatic Error Detection: The LangGraph Reflection framework simplifies the debugging process by analyzing code errors using Pyright and generating actionable insights.
Iterative Improvement: The framework continuously refines the code until errors are resolved, mimicking how a developer might manually debug and improve their code.
Adaptive Learning: The system adapts to changing code structures, such as updated library syntax or version differences.

Conclusion

The LangGraph Reflection Framework demonstrates the power of combining AI critique agents with robust static analysis tools. This intelligent feedback loop enables faster code correction, improved coding practices, and better overall development efficiency. Whether for beginners or experienced developers, LangGraph Reflection offers a powerful tool for improving code quality.

Key Takeaways

By combining LangChain, Pyright, and GPT-4o mini within the LangGraph Reflection Framework, this solution provides an effective way to automatically validate code.
The framework helps LLMs generate improved solutions iteratively and also ensures higher-quality outputs through reflection and critique cycles.
This approach enhances the robustness of AI-generated code and improves performance in real-world scenarios.

The media shown in this article is not owned by Analytics Vidhya and is used at the Author’s discretion.

Frequently Asked Questions

Q1. What is LangGraph Reflection?

A. LangGraph Reflection is a powerful framework that combines a primary AI agent (for code generation or task execution) with a critique agent (to identify issues and suggest improvements). This iterative loop improves the final output by leveraging feedback and reflection.

Q2. How does the reflection mechanism work?

A. The reflection mechanism follows this workflow:
– Main Agent: Generates the initial output.
– Critique Agent: Analyzes the generated output for errors or improvements.
– Improvement Loop: If issues are found, the main agent is re-invoked with feedback for refinement. This loop continues until the output meets quality standards.

Q3. What libraries are required to use LangGraph Reflection?

A. You’ll need the following dependencies:
– langgraph-reflection
– langchain
– pyright (for code analysis)
– faiss (for vector search)
– openai (for GPT-based models)

To install them, run: pip install langgraph-reflection langchain pyright faiss openai

Q4. What types of tasks can LangGraph Reflection improve?

A. LangGraph Reflection excels at tasks like:
– Python code validation and improvement.
– Natural language responses requiring fact-checking.
– Document summarization with clarity and completeness.
– Ensuring AI-generated content adheres to safety guidelines.

Q5. Is LangGraph Reflection only for Python code correction?

A. No, while we have shown Pyright in code correction examples, the framework can even help in improving text summarization, data validation, and chatbot response refinement.

👁 Adarsh Balan

Adarsh Balan

Hi! I'm Adarsh, a Business Analytics graduate from ISB, currently deep into research and exploring new frontiers. I'm super passionate about data science, AI, and all the innovative ways they can transform industries. Whether it's building models, working on data pipelines, or diving into machine learning, I love experimenting with the latest tech. AI isn't just my interest, it's where I see the future heading, and I'm always excited to be a part of that journey!

AI Agents Generative AI Generative AI Application Intermediate