VOOZH about

URL: https://thenewstack.io/with-gemini-pro-google-vies-for-top-spot-in-genai-race/

⇱ With Gemini Pro, Google Vies for Top Spot in GenAI Race - The New Stack


TNS
SUBSCRIBE
Join our community of software engineering leaders and aspirational developers. Always stay in-the-know by getting the most important news and exclusive content delivered fresh to your inbox to learn more about at-scale software development.
REQUIRED
It seems that you've previously unsubscribed from our newsletter in the past. Click the button below to open the re-subscribe form in a new tab. When you're done, simply close that tab and continue with this form to complete your subscription.
The New Stack does not sell your information or share it with unaffiliated third parties. By continuing, you agree to our Terms of Use and Privacy Policy.
Welcome and thank you for joining The New Stack community!
Please answer a few simple questions to help us deliver the news and resources you are interested in.
REQUIRED
REQUIRED
REQUIRED
REQUIRED
REQUIRED
Great to meet you!
Tell us a bit about your job so we can cover the topics you find most relevant.
REQUIRED
REQUIRED
REQUIRED
REQUIRED
REQUIRED
Welcome!

We’re so glad you’re here. You can expect all the best TNS content to arrive Monday through Friday to keep you on top of the news and at the top of your game.

What’s next?

Check your inbox for a confirmation email where you can adjust your preferences and even join additional groups.

Follow TNS on your favorite social media networks.

Become a TNS follower on LinkedIn.

Check out the latest featured and trending stories while you wait for your first TNS newsletter.

PREV
1 of 2
NEXT
VOXPOP
As a JavaScript developer, what non-React tools do you use most often?
Angular
0%
Astro
0%
Svelte
0%
Vue.js
0%
Other
0%
I only use React
0%
I don't use JavaScript
0%
Thanks for your opinion! Subscribe below to get the final results, published exclusively in our TNS Update newsletter:
NEW! Try Stackie AI
From clobbered drafts to real-time sync
Apr 14th 2026 10:00am, by David Moore
TypeScript 6.0 RC arrives as a bridge to a faster future
Mar 14th 2026 9:00am, by Darryl K. Taft
Mastra empowers web devs to build AI agents in TypeScript
Jan 28th 2026 11:00am, by Loraine Lawson
2023-12-14 13:51:53
With Gemini Pro, Google Vies for Top Spot in GenAI Race
AI / Large Language Models / Software Development

With Gemini Pro, Google Vies for Top Spot in GenAI Race

Google's Gemini Pro release brings the power of Google's new Gemini AI model to developers.
Dec 14th, 2023 1:51pm by Darryl K. Taft
👁 Featued image for: With Gemini Pro, Google Vies for Top Spot in GenAI Race
Image via Unsplash.

Having delivered its Gemini Pro multimodal AI model, Google showed that it’s nowhere near an also-ran in the AI race; it’s a potential top dog.

Google has made its Gemini Pro AI model available to developers to begin building applications using the company’s newly announced large language model (LLM).

Last week, Google announced Gemini as its most sophisticated LLM to date; this week, the search giant opened up access to the Pro version. Gemini comes in three flavors: Ultra for very complex applications, Pro for individual developers and enterprises, and Nano for mobile (Android — Pixel 8 Pro) environments. Also, a specifically tuned version of Gemini Pro is in Bard, Google’s generative AI chatbot.

Gemini Pro is now available in preview supported by Google AI Studio, a new, web-based development environment, as well as Vertex AI, Google’s enterprise AI platform. Developers can start building applications with Gemini through these tools.

“Google certainly has the technical chops and AI infrastructure to build world-class generative AI models,” Mike Gualtieri, an analyst at Forrester Research, told The New Stack via email. “I would bet on Google Gemini to best OpenAI and most other models because of their vast repository of content that updates frequently and lower cost.”

Exposure to Developers

Google exposes Gemini Pro through a set of APIs that developers can use to access the model, and then makes it available to the two different tools — Google AI Studio for individual developers and small development teams, and Vertex AI for enterprises who have more complex and sophisticated needs, said Thomas Kurian, CEO of Google Cloud Platform, during a press briefing about the products.

“A developer can start in Google AI Studio, and if they have more needs, they can also move to Vertex after they’ve started with Google AI Studio,” he noted. Vertex AI features more than 130 models in its Model Garden.

Google AI Studio is a free tool that enables developers to quickly develop prompts and then get an API key to use in their app development. It’s free to use right now, within limits, and it will be competitively priced, Google said.

During the press briefing, Josh Woodward, vice president of Google Labs, demonstrated how to sign into Google AI Studio with a Google account and take advantage of the free quota that allows 60 requests per minute, which he said is 20 times more than other free offerings.

Woodward also demonstrated how a developer can simply click on “Get code” to transfer their work to their integrated development environment of choice or use one of the quickstart templates available in Android StudioColab or Project IDX.

“We’ll be further fine-tuning it in the weeks and months ahead as we listen and learn from your feedback, wrote Jeanine Banks, vice president and general manager, Developer X and DevRel at Google, and Burak Gokturk, Google’s vice president and general manager, Cloud AI and Industry Solutions, in a blog post about Gemini Pro.

“To help us improve product quality, when you use the free quota, your API and Google AI Studio input and output may be accessible to trained reviewers. This data is de-identified from your Google account and API key,” the post said.

Meeting Developers Where They Are

Google provides SDKs for Gemini Pro to help developers build applications that run pretty much anywhere, with support for Python, Kotlin, Node.jsSwift and JavaScript.

“One of the things that has been consistent about our strategy for developers is how we can meet them where they are,” Banks told The New Stack in an interview. “And doing that means wherever their experience lies with AI, whatever level of expertise they’ve had as developers.”

Moreover, Gemini Pro accepts text as input and generates text as output, and provides a dedicated Gemini Pro Vision multimodal endpoint that accepts text and imagery as input, with text output.

Today’s version comes with a 32K context window for text, and future versions will have a larger context window.

In addition, Kurian said, Gemini Pro comes with a range of features including function calling, embeddings, semantic retrieval and custom knowledge grounding, and chat functionality.

Hybrid Infrastructure

Gemini was trained using Google’s latest Tensor Processing Units (TPU) across multiple data centers and clusters to improve scale and resilience. However, details on the data centers and clusters were not disclosed.

Gemini is part of a broad AI hybrid computing infrastructure. Google introduced the fifth generation of its TPU which provides significantly faster model training for existing models, much better scale for next-generation models, and doubles the number of floating point operations per second (FLOPS) per chip, Kurian said.

Gemini and other models from Google, run on an ultrascale AI infrastructure, he said. It has very sophisticated hardware, compute, and different kinds of storage, interconnected with a very low latency, and a very wide optical switching network, which allows for “extremely good” performance and throughput, he added.

“On top of this, we expose a managed software stack to make it easy for people building models and using our models to use this infrastructure,” Kurian said during the briefing. “It starts with two flavors of managed compute, Google Compute Engine and Google Kubernetes Engine.

“We offer a variety of different styles of workloads. If somebody wants to do a long training run, they can do batch, but if they want to split the work across multiple clusters in multiple data centers, we call that multihost. If they want to dynamically assemble slices of infrastructure from different systems, we call that multislice.”

Google also optimized the most popular AI frameworks and libraries, including TensorFlow and PyTorch, on this infrastructure.

“It’s important if you have the tools and you have the SDKs, it’s critically important to have the best-performing infrastructure,” Banks told TNS.

Real Competition

Competitively, developers want powerful models, scalable infrastructure, good tooling and APIs connected to what they already use. Google aims to provide all three, which gives the company an edge, Woodward said in a video interview.

“Gemini Pro is just a natural extension of Gemini for developers to start building applications,” Gualtieri said. “The real competition though will be open source models downloaded from HuggingFace. There are certainly use cases for megamodels, but there will be tens of thousands of small expertise-specific models — just like human expertise. You don’t need a model to summarize Shakespeare if your use case is commercial real estate leasing contracts.”

Pricing for Gemini Pro is reduced per character to attract more developers, while indemnification coverage has been expanded, Woodward said.

Pricing for Gemini Pro is $0.00025 per 1,000 characters and $0.0025 per image for input, and $0.0005 per 1,000 characters for output.

Google also announced new integrations and capabilities, including Gemini powering Google’s real-time collaboration assistant Duet AI in Workspace, and new AI-powered programming assistants and security tools in Cloud.

Accenture Partnership

Meanwhile, Accenture and Google Cloud launched a new initiative to help organizations adopt generative AI to improve operations, create new lines of business, and more.

The companies said they will create a global, joint generative AI Center of Excellence that will help businesses build and scale applications using Google Cloud’s generative AI portfolio, Accenture officials said.

“As a catalyst for business reinvention, generative AI will transform how people work and access information,” said Karthik Narain, group chief executive of technology at Accenture, in a statement. “Organizations want to move from experimentation with generative AI to scaled implementations faster. Accenture’s deep expertise in managing and scaling large language models tailored for business needs, paired with tools like Accenture’s model switchboard, can help accelerate adoption.”

Accenture itself has more than 300 scaled generative AI projects and AI solutions, and more than 1,450 AI patents and patent-pending applications.

Indeed, some of the major systems and service provider’s customers are looking for help with their generative AI efforts. Blue Cross is one such enterprise customer.

“At Independence Blue Cross, we’ve harnessed the power of AI to better manage massive amounts of data and show opportunities to improve member experiences and introduce new services,” said Michael R. Vennera, executive vice president and chief strategy, corporate development and information officer, Independence Health Group, said in a statement.

“We are looking forward to working with Accenture and Google Cloud through their new Center of Excellence to explore new ways generative AI can help proactively manage our members’ health.”

TRENDING STORIES
Darryl K. Taft covers DevOps, software development tools and developer-related issues from his office in the Baltimore area. He has more than 25 years of experience in the business and is always looking for the next scoop. He has worked...
Read more from Darryl K. Taft
SHARE THIS STORY
TRENDING STORIES
TNS owner Insight Partners is an investor in: Real, OpenAI.
SHARE THIS STORY
TRENDING STORIES
TNS DAILY NEWSLETTER Receive a free roundup of the most recent TNS articles in your inbox each day.
The New Stack does not sell your information or share it with unaffiliated third parties. By continuing, you agree to our Terms of Use and Privacy Policy.