Monarch Early Access

Apr 14

Access and manage powerful GPUs.

We're announcing the early access launch of Monarch, a unified platform built to simplify and streamline the complex process of managing AI infrastructure across multiple cloud providers. Monarch is designed specifically for AI researchers, startups, and enterprise teams who require efficient, cost-effective, and scalable GPU computing without the hassle of juggling multiple platforms and APIs.

Why Monarch?

Currently, AI teams face a challenging and fragmented ecosystem. The realities of working with multiple cloud providers—each with unique interfaces, pricing models, and availability—create inefficiencies and overhead. Monarch directly addresses this pain point by providing a single, coherent API and intuitive dashboard that aggregates GPU compute resources across major providers such as AWS, Google Cloud, Azure, Vast, RunPod, Paperspace, and more.

One of Monarch’s core strengths is real-time GPU resource discovery. Instead of manually checking each cloud provider, users instantly see which GPUs are available, their detailed specifications—including memory, type, and location—and real-time pricing comparisons. This capability enables informed, cost-effective decisions, giving teams confidence that they're always utilizing the most economical and powerful GPU resources for their specific tasks.

Users can effortlessly submit and manage model training jobs directly through the platform’s streamlined API. Monarch intelligently selects the optimal cloud provider based on current costs and availability, automatically routing workloads to the best-suited GPUs. As training progresses, users receive real-time updates and transparent monitoring, ensuring complete visibility into their job statuses without having to interface directly with provider-specific dashboards.

How To Start?

To get started, simply sign up for early access. Upon joining, you'll receive a secure API key that grants you access to Monarch’s infrastructure capabilities. Capacity during this time could be limited, and we will prioritize early signups and specific workloads based on availability and alignment with our early access program goals.

Once you have your API key, interacting with Monarch is as straightforward as making simple REST API requests. With each query, you can instantly retrieve detailed and up-to-date information about available GPU resources across various cloud providers.

Monarch’s API not only returns basic GPU details—it provides comprehensive specifications including GPU type, cloud provider, memory capacity, geographic region, instance type, and precise on-demand and spot pricing. This ensures that every decision you make is fully informed, eliminating guesswork and maximizing cost-efficiency.

In summary:

Sign Up for Access: You'll receive an API key upon joining the early access program.
Make API Requests: Use a simple REST API to query available GPUs and their specifications.
Compare Options: See pricing and specs side-by-side to make informed decisions.
Select Your Resources: Choose the best GPU option for your specific workload.

With flexible pricing tiers tailored to individual researchers, growing startups, and established enterprise teams, Monarch ensures predictable and optimized costs regardless of scale.

Which GPUs?

Monarch currently offers comprehensive access to a diverse range of GPU types, ensuring that you can precisely match the needs of your workload with optimal compute power. Our platform spans both cutting-edge and widely-used GPUs, catering to a variety of training, inference, and experimentation requirements.

Available GPUs include:

Latest NVIDIA GPUs:

H100: Exceptional performance for large-scale AI training, heavy inference workloads, and high-end experimentation.
A100 & A100-80GB: Ideal for demanding AI training, large models, and intensive deep learning tasks.
L40S & L4: Optimized for fast, efficient inference and real-time AI applications.
RTX 6000 Ada, RTX 5090, RTX A6000: Advanced RTX GPUs designed specifically for next-generation AI and graphics workloads.

Professional GPUs:

NVIDIA A4000, A5000, A6000: Robust professional GPUs providing a balanced combination of power, efficiency, and stability for reliable AI workloads, graphic design, and professional video rendering.

Consumer GPUs:

RTX 3080, RTX 3090, RTX 2080 Ti: Powerful and affordable GPUs that effectively balance cost and performance, making them ideal for rapid prototyping, fine-tuning experiments, and smaller-scale AI workloads.

Beyond just GPU variety, Monarch’s infrastructure is strategically distributed. Our providers offer GPU availability across multiple global regions, including extensive coverage in North America, Europe, and Asia. This international distribution allows you to select compute resources close to your end-users or data centers, significantly reducing latency, improving data sovereignty, and optimizing performance.

Monarch also offers access to local GPUs based directly within Africa. This regional infrastructure not only addresses latency and regulatory concerns but also provides critical support for AI projects focused specifically on African languages, datasets, and context. By offering local compute, Monarch is actively reshaping the landscape of AI infrastructure availability across the continent.

We provide both global reach and local depth, ensuring you have precise control and flexibility over your GPU resources, wherever your users and data may be.

Who Should Join?

Monarch is designed to cater to specific AI-driven use cases, making it the ideal solution for those working on innovative and performance-critical AI workloads.

AI researchers benefit immensely from Monarch’s extensive GPU catalog and streamlined resource management. By accessing the most cost-effective GPUs—including the latest NVIDIA hardware—researchers can significantly reduce both experimentation time and infrastructure costs.

Startups running machine learning workloads find Monarch particularly valuable because it simplifies complex GPU infrastructure management, saving critical engineering resources. Instead of investing significant time and capital in infrastructure setup and management, startups can focus purely on their product and innovation, scaling GPU resources seamlessly as their business grows.

Developers building and testing AI applications benefit from Monarch’s unified API, enabling effortless deployment, testing, and fine-tuning across multiple GPU types and providers. This flexibility dramatically shortens development cycles, improves reliability, and ensures developers spend their time building great software rather than navigating complex infrastructure configurations.

Data scientists training models on various GPU configurations find Monarch especially powerful. With comprehensive access to diverse GPU architectures—ranging from consumer-grade GPUs ideal for rapid experimentation to advanced NVIDIA H100 and A100 GPUs suitable for massive model training—data scientists can optimize performance, costs, and outcomes simultaneously.

Join the Program

Ready? Join our early access program today to get priority access and help shape the future of Monarch.

Simply fill out our sign-up form to get started. We'll be accepting a limited number of users during this phase to ensure we can provide the best possible experience.

We're looking forward to having you on board!

Get Early Access

Ian Wambai