VOOZH about

URL: https://www.analyticsvidhya.com/blog/2023/12/apple-secretly-launches-its-first-open-source-llm-ferret/

⇱ Apple Ferret: Apple's First Open-Source LLM, Secretly Launched


India's Most Futuristic AI Conference Is Back – Bigger, Sharper, Bolder

  • d
  • :
  • h
  • :
  • m
  • :
  • s

Reading list

Apple Secretly Launches Its First Open-Source LLM, Ferret

K.C. Sabreena Basheer Last Updated : 30 Dec, 2023
2 min read

Apple has quietly introduced Ferret, its first open-source multimodal large language model (LLM), marking a significant departure from its traditional secretive approach. Developed in collaboration with Columbia University, Ferret integrates language understanding with image analysis, promising groundbreaking applications in various fields. This strategic move reflects Apple’s commitment to stay at the forefront of the rapidly evolving multimodal AI landscape.

Unveiling Ferret

In collaboration with Columbia University, Apple released Ferret, an open-source multimodal LLM, without the fanfare typically associated with such breakthroughs. Unlike its closed-door strategy, this move emphasizes Apple’s dedication to openness and potential collaboration in the AI community.

Also Read: BharatGPT: Reliance Jio’s AI Collaboration with IIT Bombay

Ferret’s Technical Marvel

Ferret, powered by 8 Nvidia A100 GPUs, outshines its counterparts in understanding small image regions and describing them with fewer errors. Trained on the GRIT dataset, it excels in referring and grounding tasks, showcasing Apple’s prowess in generative AI and multimodal capabilities.

Ferret’s approach goes beyond textual comprehension, analyzing specific regions of images and incorporating them into queries. This unique integration allows for contextual responses, offering deeper insights into visual content and setting a new standard in AI capabilities.

Ferret’s Impact on Apple Devices

Ferret’s integration into Apple products could revolutionize user experiences. Improved image-based interactions with Siri, advanced visual search functionalities, augmented user assistance for accessibility, and enriched media understanding are among the potential applications. Developers, too, can leverage Ferret’s capabilities for innovative applications across various domains.

Also Read: Apple Prepares for Breakthrough in AI in 2024 with Apple GPT, Ajax, and iOS 18

Challenges and Future Prospects

While Ferret’s potential impact on Apple devices is considerable, scalability poses a challenge. Apple faces questions about its ability to compete with larger models like GPT-4 due to infrastructure limitations. The dilemma prompts strategic decisions, potentially involving partnerships or further embracing open-source principles.

Our Say

Apple’s introduction of Ferret signals a paradigm shift in its AI strategy. The open-source approach invites collaboration and innovation, reflecting a broader commitment to advancing AI technology. As Ferret’s capabilities unfold, it holds the promise of reshaping how we interact with technology, emphasizing a more nuanced understanding of visual content in AI applications.

Sabreena is a GenAI enthusiast and tech editor who's passionate about documenting the latest advancements that shape the world. She's currently exploring the world of AI and Data Science as the Manager of Content & Growth at Analytics Vidhya.

Login to continue reading and enjoy expert-curated content.

Free Courses

Generative AI - A Way of Life

Explore Generative AI for beginners: create text and images, use top AI tools, learn practical skills, and ethics.

Getting Started with Large Language Models

Master Large Language Models (LLMs) with this course, offering clear guidance in NLP and model training made simple.

Building LLM Applications using Prompt Engineering

This free course guides you on building LLM apps, mastering prompt engineering, and developing chatbots with enterprise data.

Improving Real World RAG Systems: Key Challenges & Practical Solutions

Explore practical solutions, advanced retrieval strategies, and agentic RAG systems to improve context, relevance, and accuracy in AI-driven applications.

Microsoft Excel: Formulas & Functions

Master MS Excel for data analysis with key formulas, functions, and LookUp tools in this comprehensive course.

Responses From Readers

Flagship Programs

GenAI Pinnacle Program| GenAI Pinnacle Plus Program| AI/ML BlackBelt Program| Agentic AI Pioneer Program

Free Courses

Generative AI| DeepSeek| OpenAI Agent SDK| LLM Applications using Prompt Engineering| DeepSeek from Scratch| Stability.AI| SSM & MAMBA| RAG Systems using LlamaIndex| Building LLMs for Code| Python| Microsoft Excel| Machine Learning| Deep Learning| Mastering Multimodal RAG| Introduction to Transformer Model| Bagging & Boosting| Loan Prediction| Time Series Forecasting| Tableau| Business Analytics| Vibe Coding in Windsurf| Model Deployment using FastAPI| Building Data Analyst AI Agent| Getting started with OpenAI o3-mini| Introduction to Transformers and Attention Mechanisms

Popular Categories

AI Agents| Generative AI| Prompt Engineering| Generative AI Application| News| Technical Guides| AI Tools| Interview Preparation| Research Papers| Success Stories| Quiz| Use Cases| Listicles

Generative AI Tools and Techniques

GANs| VAEs| Transformers| StyleGAN| Pix2Pix| Autoencoders| GPT| BERT| Word2Vec| LSTM| Attention Mechanisms| Diffusion Models| LLMs| SLMs| Encoder Decoder Models| Prompt Engineering| LangChain| LlamaIndex| RAG| Fine-tuning| LangChain AI Agent| Multimodal Models| RNNs| DCGAN| ProGAN| Text-to-Image Models| DDPM| Document Question Answering| Imagen| T5 (Text-to-Text Transfer Transformer)| Seq2seq Models| WaveNet| Attention Is All You Need (Transformer Architecture) | WindSurf| Cursor

Popular GenAI Models

Llama 4| Llama 3.1| GPT 4.5| GPT 4.1| GPT 4o| o3-mini| Sora| DeepSeek R1| DeepSeek V3| Janus Pro| Veo 2| Gemini 2.5 Pro| Gemini 2.0| Gemma 3| Claude Sonnet 3.7| Claude 3.5 Sonnet| Phi 4| Phi 3.5| Mistral Small 3.1| Mistral NeMo| Mistral-7b| Bedrock| Vertex AI| Qwen QwQ 32B| Qwen 2| Qwen 2.5 VL| Qwen Chat| Grok 3

AI Development Frameworks

n8n| LangChain| Agent SDK| A2A by Google| SmolAgents| LangGraph| CrewAI| Agno| LangFlow| AutoGen| LlamaIndex| Swarm| AutoGPT

Data Science Tools and Techniques

Python| R| SQL| Jupyter Notebooks| TensorFlow| Scikit-learn| PyTorch| Tableau| Apache Spark| Matplotlib| Seaborn| Pandas| Hadoop| Docker| Git| Keras| Apache Kafka| AWS| NLP| Random Forest| Computer Vision| Data Visualization| Data Exploration| Big Data| Common Machine Learning Algorithms| Machine Learning| Google Data Science Agent
👁 Av Logo White

Continue your learning for FREE

Forgot your password?
👁 Av Logo White

Enter OTP sent to

Edit

Wrong OTP.

Enter the OTP

Resend OTP

Resend OTP in 45s

👁 Popup Banner
👁 AI Popup Banner