Google Cloud enables self-deployment of proprietary AI models in customer VPCs

Google Cloud has announced that users can now self-deploy proprietary AI models from partners such as Mistral AI and CAMB.AI within their own Virtual Private Cloud (VPC) on Vertex AI. This update allows streaming and tech enterprises to run closed-source models for various tasks, including multilingual text-to-speech, while maintaining data sovereignty and VPC security policies. The models are available through the Vertex AI Model Garden, offering a curated catalog for discovery, testing, and deployment.

Key Takeaways

Users can now self-deploy proprietary models from eight partners, including Mistral AI and CAMB.AI, directly into their VPC.
This capability supports closed-source models and those with restricted commercial licenses, maintaining adherence to VPC-SC policies.
The Vertex AI Model Garden serves as a curated catalog for discovering, testing, and deploying these models, which include AI21 Labs' Jamba Large 1.6 and CAMB.AI's MARS7 for hyper-realistic TTS.
Organizations can optimize performance or cost by scaling deployments manually or with auto-scaling, and select Google Cloud regions for data compliance.

Why It Matters

This move by Google Cloud significantly addresses enterprise demand for greater control and security over AI model deployment, particularly for sensitive streaming data. Allowing proprietary models within a customer's VPC mitigates data residency and compliance concerns, which are critical for media companies handling vast amounts of user information and localized content. It enables faster adoption of advanced AI capabilities, like multilingual voice cloning, without compromising data security. The next indicator will be the rate at which major streaming platforms adopt this self-deployment feature for their AI workflows, signaling a shift in cloud AI strategy.

Read full article at cloud.google.com

Get this in your inbox → Subscribe

Enjoy our coverage?

Add StreamingMeme as a preferred source on Google to see more of our streaming news at the top of your Search results.

Add as preferred source

Content+Technology: Runway launches Media Router to automate generative video model selection

X: vLLM v0.26.0 introduces tiered KV offloading and multimodal audio-video support

IT Brief UK: Fetch.ai and RedSquid TV launch first agentic AI television platform

Google Cloud enables self-deployment of proprietary AI models in customer VPCs

Key Takeaways

Users can now self-deploy proprietary models from eight partners, including Mistral AI and CAMB.AI, directly into their VPC.
This capability supports closed-source models and those with restricted commercial licenses, maintaining adherence to VPC-SC policies.
The Vertex AI Model Garden serves as a curated catalog for discovering, testing, and deploying these models, which include AI21 Labs' Jamba Large 1.6 and CAMB.AI's MARS7 for hyper-realistic TTS.
Organizations can optimize performance or cost by scaling deployments manually or with auto-scaling, and select Google Cloud regions for data compliance.

Why It Matters

Read full article at cloud.google.com

Google Cloud enables self-deployment of proprietary AI models in customer VPCs

Key Takeaways

Why It Matters

Enjoy our coverage?

Related Articles

Google Cloud enables self-deployment of proprietary AI models in customer VPCs

Key Takeaways

Why It Matters

Enjoy our coverage?

Related Articles

Newest

Upcoming Events

Top Sources

Newest

Upcoming Events

Top Sources

Related Articles

Runway launches Media Router to automate generative video model selection

vLLM v0.26.0 introduces tiered KV offloading and multimodal audio-video support

Fetch.ai and RedSquid TV launch first agentic AI television platform