AWS acquires white hot gen AI media creation startup, becoming cloud provider of choice



Generative AI’s rapid shift from text-based chatbots to high-definition media—images, video, spatial 3D, and audio—has exposed a glaring bottleneck in today’s tech stack: infrastructure. Rendering pixels in real-time requires a staggering amount of computation, and developers are increasingly struggling to manage sharded GPU clusters just to keep their applications online.

Enter it fortune tellerOpenAI’s ChatGPT-Images-2.0 and Google’s Nano Banana Pro 2 is a generative media creation platform that offers hundreds of leading AI image, video and audio creation and editing models for creating and editing, from OpenAI’s open source interface and all open source interfaces.

Today, the San Francisco-based startup, which was recently valued at $4.5 billion after a $300 million Series D round led by Sequoia Capital, announced chose Amazon Web Services (AWS) as its preferred cloud provider.

While financial terms of the deal were not disclosed publicly, the move signals a maturation in the generative media space, shifting the focus from simply building basic models to effectively scaling them up for mass, commercial consumption.

“AWS has been there for distribution and monetization, and for the use of AI in creative pursuits – helping designers, developers and the creative community think about how they can use AI responsibly, at scale and globally." Samira Panah Bakhtiyar, General Manager of Media, Entertainment, Games and Sports at AWS, told VentureBeat in an exclusive interview.

A one-stop shop for Gen AI media that allows businesses to connect and choose the best model for their needs

At its core, fortune-telling functions as a single gateway to a rapidly expanding generative artificial intelligence ecosystem. Instead of forcing developers to provision their own servers, deal with latency issues, or integrate disparate open source model weights, fal provides a single, unified API. Through this API, users get instant access to over 1,000 production-ready AI models.

Think of it as the Stripe or Plaid of generative media: abstract away the devastatingly complex back-end plumbing so that developers can focus solely on the user experience.

This is a "plug and play" A solution that already attracts independent creators and enterprise giants by powering generative workflows for enterprises including Canva, Adobe and Amazon MGM Studios.

“General media workloads require a fundamentally different layer of infrastructure that can handle massively parallel inference, rapid model iteration, and production-level reliability at scale,” said Gorkem Yurtseven, CTO and co-founder of fal, in a statement to VentureBeat.

Neither AWS nor fal specified which cloud or GPU providers the latter used before they struck a deal together. When asked who used AWS before, Bakhtiar did not name the previous cloud or GPU provider and instead said that the fortune teller now uses AWS services.

a blog postfal’s head of Compute Partnerships, Emir Lise, described AWS as providing a “global layer of scale and reliability” for existing serverless generative-media infrastructure – building the partnership around flexibility, reliability and enterprise scale rather than replacing said incumbent.

A public search was conducted Tigris as a storage provider for divination — Tigris says Fortune runs a “global fleet of GPUs across multiple clouds” — and Horoscope announcement in September 2025 it is available through the Google Cloud Marketplace, allowing customers to purchase fal through Google Cloud billing and management, but the listing does not mention that Google Cloud is equipped with fal’s GPU infrastructure.

99.99% guaranteed uptime?

By partnering with AWS, fail aims to combine Amazon’s highly optimized inference engine with global reach to handle millions of daily API calls with a guaranteed uptime of 99.99%.

In addition, Bakhtiyar said that fortune telling users can expect to see "faster results and performance, greater efficiency, greater scalability, and more seamless service continuity — everything you’d expect from a partnership with the world’s largest, most widely adopted cloud."

Therefore, the main benefit for fortune telling users is better performance and reliability without changing how they work: faster results, greater scalability, smoother continuity, and access to production-ready AI models without managing their own infrastructure.

In fact, the partnership makes the platform even more powerful for creators, studios and enterprise customers by supporting it with AWS security, global scale and cloud infrastructure.

For AWS, it helps push cloud and AI deeper into creative production, not just distribution or monetization. It positions AWS as the go-to infrastructure partner for studios, media companies, developers and individual creators building AI-powered content workflows.

Offloading the GPU

The collaboration with AWS is designed to address the sheer physics and cost of rendering generative media. By moving its operations to AWS, fal will be able to take advantage of Amazon’s extensive AI services, including the Bedrock platform, custom-designed silicon such as Trainium and Graviton processors.

"You don’t need to manage like a fleet of GPUs to use artificial intelligence for creative searches," Bakhtiyar explained.

This is a critical pain point in 2026 for the demands of creating larger scale media. Securing high-performance GPUs for parallel inference is both expensive and technically demanding.

By offloading this load to AWS, fal allows creatives to focus on their own workflows without the need for a dedicated DevOps team.

Bakhtiyar also noted that he is strong "network effect" Building on AWS. Since major studios and creative platforms (like Adobe and Canva) are already deeply rooted in the AWS ecosystem, integrating fal’s API into their existing pipelines becomes a frictionless task.

Enterprise-grade security and compatibility with gen AI creative speed

For IT leaders and developers, fal architecture offers distinct advantages in terms of licensing, security and deployment.

Historically, using boundary generative models meant either accepting strict vendor lock-in from a single provider or trying to deploy open-source models locally.

The latter requires significant overhead and forces enterprises to navigate the minefield of various open source licenses (such as MIT, Apache 2.0, or restrictive non-commercial licenses).

fal overcomes this friction by offering commercial API access to a curated ecosystem of models. Developers only pay for the output they consume.

In addition, the platform is SOC 2 compliant and is open source "enterprise scale," it meets the strict data privacy and security criteria required by highly regulated industries and mass consumer platforms.

For large media conglomerates, this managed service approach allows them to securely test with the latest modern tools without the risk of exposing proprietary data or intellectual property.

Empowering developers and vibe coders

However, the true impact of a divination platform is best seen at the developer level. By democratizing access to high-end infrastructure, fortune enables a new class of builders—often called so. "vibe encoders"— to create complex, multimodal applications without traditional computer science.

As Bakhtiyar mentioned, access to these tools is fundamental "it levels the playing field". Whether it’s an individual developer coding a side project or a hobbyist vibe, or a fully funded editor or director pitching a blockbuster movie, the underlying technology is now the same, infinitely scalable and production-ready.

“More creatives – whether they’re full-fledged studios, indie brands or individual content creators – will now be able to access these tools, and as a result, they’ll be able to punch well above their weight," Bakhtiyar said he introduced the partnership as a way to serve even more users through divination thanks to the reliability of AWS servers and the special Trainium, Graviton and Inferentia chips.

The rollout of enhanced AWS capabilities for rogue customers will occur in phases throughout 2026.



Source link

Leave a Reply

Your email address will not be published. Required fields are marked *