Summary: In flat-rate AI products, the top 10% of customers by usage typically generate 80% of API costs while paying the same rate as light users. This pattern is visible in GitHub Copilot's margin structure, Cursor's four repricing cycles in 2024, and ChatGPT Pro's unit economics at $200/month.
The Cost Distribution Pattern
Not all customers cost the same. In a typical AI product, the top 10% of users generate 80% of API costs—while paying the same flat rate as everyone else.
Consider a $29/month product with 100 customers. Revenue: $2,900. Then the OpenAI bill arrives: $4,500. The math doesn't work because cost distribution is uneven.
Three Customer Cohorts
Every AI product shows three distinct usage cohorts:
| Cohort | % of Customers | API Cost | Revenue | Margin |
|---|---|---|---|---|
| Light Users | 60% | $5-10/mo | $29/mo | +$20 |
| Average Users | 30% | $25-35/mo | $29/mo | ~$0 |
| Power Users | 10% | $80-150/mo | $29/mo | -$50 to -$120 |
Light users cross-subsidize power users. The question is whether the math works at scale.
Case Studies
GitHub Copilot: -$20 Average Margin
GitHub Copilot charges $10/month with an average margin of -$20/user.
| Cohort | % of Users | API Cost | Margin |
|---|---|---|---|
| Light | 20% | $5/mo | +$5 |
| Average | 60% | $15-20/mo | -$5 to -$10 |
| Power | 20% | $50-80/mo | -$40 to -$70 |
Microsoft's position: subsidize with Azure infrastructure margin rather than raise prices.
Cursor: Four Repricing Cycles in 12 Months
Cursor reached $500M ARR with negative per-customer margins. Their pricing evolution:
- January 2024: $20/month unlimited
- May 2024: $20/month with soft limits (500 completions)
- September 2024: $20/month with hard limits + overages
- December 2024: $200/month "Ultra" tier (June 2025: $71 single-day charges reported)
Each cycle involved customer communication, retention considerations, and support volume.
ChatGPT Pro: Negative at $200/Month
Sam Altman stated publicly that ChatGPT Pro at $200/month shows negative margin on power users. At 20x the base tier price, power users consume 50-100x more resources.
The Margin Formula
The blended margin calculation:
Total Margin = (Light User Margin × 60%) + (Average User Margin × 30%) + (Power User Margin × 10%)
Example with $29/month pricing:
= (+$20 × 0.6) + (-$5 × 0.3) + (-$100 × 0.1)
= +$12 - $1.50 - $10
= +$0.50 per customer
Gross Margin: 1.7% (vs. SaaS benchmark of 70-80%)
One power user joining offsets approximately five light users' contribution. As scale increases, power user percentage often increases, compressing margins further.
If you don't know which cohort each customer falls into, you're pricing blind. See how Bear Lumen tracks per-customer costs →
Pricing Model Options
| Model | How It Works | Examples |
|---|---|---|
| Usage Limits | Cap usage per tier | Cursor (500 requests), Claude (message caps) |
| Tiered Pricing | Higher tiers for heavy users | Copilot ($10/$19), ChatGPT ($20/$200) |
| Hybrid | Base subscription + overages | Vercel, Twilio, AWS |
| Outcome-Based | Charge for results, not usage | Twilio AI (per-ticket), emerging model |
The pattern: flat pricing with variable AI costs requires adjustment as usage distribution becomes visible.
The Market Pattern
AI companies typically follow one of these paths:
- Repriced (Cursor, Notion AI, Jasper)
- Subsidized by other revenue (Microsoft with Copilot)
- Operating with compressed margins
Companies with sustainable margins typically track costs per customer in real-time, model pricing scenarios with current data, and build usage-based infrastructure early.
Key Takeaways
- 60% of customers show positive margin, 10% show significantly negative margin — Visibility into cohort-level margins is essential
- Power users cost 10-100x more than light users — Flat pricing cannot accommodate this variance
- Even $200/month may show negative margin — ChatGPT Pro demonstrates the limit of tier-based pricing
- Proactive pricing changes outperform reactive ones — Communicate early with clear rationale
Know your per-customer margins. Visibility into cohort-level economics enables informed pricing decisions.
Request early access to Bear Lumen →
Related Resources
- The Real Cost of Running an AI Product in 2025 — Complete TCO breakdown
- From Flat-Rate to Usage-Based Pricing Migration — Migration playbook
- GitHub Copilot Margin Analysis — Detailed Copilot economics