Analysis — Ozmarx

Market Reports

Market Report

State of AI Compute: Q1 2026

H100 availability, pricing trends, neocloud vs hyperscaler dynamics, and what's changed in the past 12 months. Essential reading for anyone buying compute in 2026.

March 2026 15 min read

Market Report

GPU Spot Market Deep Dive: Is It Worth the Risk?

Spot and preemptible instances can cut your GPU bill by 60–90%. But interruptions, checkpointing overhead, and availability uncertainty make the calculus non-obvious. We break it down.

February 2026 10 min read

Market Report

Why AWS Cut H100 Prices 44% — And What Comes Next

In June 2025, AWS made an aggressive price cut on H100 instances. We look at what drove it, how competitors responded, and what it signals for 2026 pricing trends.

January 2026 9 min read

Buyer Guides

Buyer's Guide

Training vs Inference: The Compute Stack Is Not the Same

Most teams buy for training and realize inference has completely different requirements. GPU choice, networking, cost structure — here's what changes when you move to production.

February 2026 10 min read

Buyer's Guide

H100 vs A100 for Training: When Is the Upgrade Worth It?

The H100 is 2–3× faster for transformers, but costs 60–80% more per hour. We model out real training runs to show when you should upgrade — and when you're overpaying for headroom.

February 2026 8 min read

Buyer's Guide

How to Actually Estimate Your GPU Budget Before You Buy

Most teams either dramatically over-buy or run out of compute mid-training. This guide walks through a real methodology for estimating compute needs from model size, dataset size, and timeline.

January 2026 12 min read

Buyer's Guide

Reserved vs On-Demand vs Spot: A Decision Framework

The right pricing model depends on your workload type, fault tolerance, and budget. We walk through a decision tree that most procurement teams can actually use.

January 2026 7 min read

Buyer's Guide

The Hidden Costs of Hyperscaler GPU Compute

Egress fees, storage, data transfer, support plans, and tooling costs — hyperscaler list prices rarely tell the full story. Here's what actually shows up on the bill.

December 2025 11 min read

Buyer's Guide

Neocloud vs Hyperscaler: Which Is Right for Your Team?

Neoclouds are significantly cheaper and often faster to provision. But hyperscalers offer broader ecosystems and compliance coverage. Here's how to decide.

December 2025 9 min read

Provider Deep Dives

Provider Deep Dive

CoreWeave vs Lambda Labs: An Honest Comparison

Both GPU-native neoclouds with strong H100 availability. But they serve different buyers in subtle ways. Reliability, pricing models, networking, and who belongs where.

March 2026 11 min read

Provider Deep Dive

Is Vast.ai Reliable Enough for Production Workloads?

Vast.ai's marketplace pricing is unbeatable. But the "community" nature raises questions about reliability and support. We look at the real use cases and risk profile.

February 2026 8 min read

Provider Deep Dive

AWS for AI Compute: When Does It Actually Make Sense?

AWS is rarely the cheapest option. But there are real scenarios where paying the premium is justified — compliance, data locality, VPC integration. We map them out.

January 2026 10 min read

Technical

NVLink vs PCIe: Why the Interconnect Matters for Multi-GPU Training

The difference between SXM and PCIe variants isn't just clock speed — it's interconnect bandwidth. For large model training, this can be the difference between linear and sublinear scaling.

March 2026 9 min read

Technical

Tensor Parallelism vs Pipeline Parallelism: A Practical Cost Guide

Choosing the wrong parallelism strategy wastes GPU hours. This guide breaks down the tradeoffs — and shows the compute and networking cost implications of each approach.

January 2026 14 min read

Technical

FP8 vs BF16 vs FP16: What Precision to Use and Why It Affects Cost

Lower precision means faster training and cheaper compute. But it also means more careful implementation. Here's the practical breakdown of when each precision makes sense.

December 2025 8 min read

AI Compute Analysis

State of AI Compute: Q1 2026

Market Reports

State of AI Compute: Q1 2026

GPU Spot Market Deep Dive: Is It Worth the Risk?

Why AWS Cut H100 Prices 44% — And What Comes Next

Buyer Guides

Training vs Inference: The Compute Stack Is Not the Same

H100 vs A100 for Training: When Is the Upgrade Worth It?

How to Actually Estimate Your GPU Budget Before You Buy

Reserved vs On-Demand vs Spot: A Decision Framework

The Hidden Costs of Hyperscaler GPU Compute

Neocloud vs Hyperscaler: Which Is Right for Your Team?

Provider Deep Dives

CoreWeave vs Lambda Labs: An Honest Comparison

Is Vast.ai Reliable Enough for Production Workloads?

AWS for AI Compute: When Does It Actually Make Sense?

Technical

NVLink vs PCIe: Why the Interconnect Matters for Multi-GPU Training

Tensor Parallelism vs Pipeline Parallelism: A Practical Cost Guide

FP8 vs BF16 vs FP16: What Precision to Use and Why It Affects Cost

New analysis drops weekly