External Memory Series — File-based memory for AI-assisted work (overview · 1 Implementation · 2 Productivity · 3 vs the diagram · 4 Governance)
The Problem: Copilot's Pricing Model Just Changed
GitHub Copilot is moving from request-based billing to token-based billing on June 1, 2026. If you're a heavy coder on Copilot Pro+ at $39/month, you may have already hit your 1,500 premium request allowance by mid-month. The question isn't whether Copilot is useful — it's whether the new pricing model still makes sense once you compare it against direct API access through OpenRouter, or against the much cheaper (and riskier) third-party relays that have appeared on the market.
The answer depends entirely on how you code, and on how much trust you're willing to extend to non-official providers.
A note on numbers. All prices in this article are expressed in US dollars per 1 million tokens. Third-party relay prices originally shown in Chinese renminbi (¥) have been converted at approximately ¥7.10 = $1.00 USD so you can compare apples to apples.
Why This Matters
The shift from "requests per month" to "tokens consumed" fundamentally changes the economics of AI-assisted coding. Under the old model, a complex refactor and a simple autocomplete both counted as one premium request. Under the new model, that refactor might consume 50,000 tokens while the autocomplete uses 200 — and you pay accordingly.
For developers who:
- run agent-based coding workflows
- work with large repository contexts
- use Copilot code review extensively
- prefer GPT-5.5 or Claude Opus 4.7 for complex tasks
…the token-based model can quietly exceed the value of the $39/month subscription within days.
What GitHub Copilot Actually Costs Now
Starting June 1, 2026, GitHub Copilot charges by tokens, converted to GitHub AI Credits where 1 AI credit = $0.01 USD. (GitHub docs)
Current Plan Structure
| Plan | Price | Included Allowance |
|---|---|---|
| Copilot Free | $0 | 50 requests/month, 2,000 completions/month |
| Copilot Pro | $10/month | 300 premium requests/month |
| Copilot Pro+ | $39/month | 1,500 premium requests/month |
Once that allowance is exhausted, you're paying per token at the rates below.
New Token-Based Pricing (Effective June 1, 2026)
All prices in USD per 1 million tokens. Verified against the GitHub Copilot pricing tables on May 24, 2026.
OpenAI models:
| Model | Input | Cached Input | Output |
|---|---|---|---|
| GPT-4.1 | $2.00 | $0.50 | $8.00 |
| GPT-5 mini | $0.25 | $0.025 | $2.00 |
| GPT-5.2 | $1.75 | $0.175 | $14.00 |
| GPT-5.3-Codex | $1.75 | $0.175 | $14.00 |
| GPT-5.4 | $2.50 | $0.25 | $15.00 |
| GPT-5.4 mini | $0.75 | $0.075 | $4.50 |
| GPT-5.5 | $5.00 | $0.50 | $30.00 |
Anthropic models (Anthropic adds a separate cache-write cost):
| Model | Input | Cached Input | Cache Write | Output |
|---|---|---|---|---|
| Claude Haiku 4.5 | $1.00 | $0.10 | $1.25 | $5.00 |
| Claude Sonnet 4.5 | $3.00 | $0.30 | $3.75 | $15.00 |
| Claude Sonnet 4.6 | $3.00 | $0.30 | $3.75 | $15.00 |
| Claude Opus 4.7 | $5.00 | $0.50 | $6.25 | $25.00 |
Important: Inline code completions and next-edit suggestions remain unlimited on paid plans and do not consume AI credits. The new pricing only applies to chat, agent mode, code review, and other token-consuming features.
How OpenRouter and Third-Party Relays Compare
This is where the comparison gets interesting — and where you need to be careful about what you're buying.
OpenRouter: Roughly Pass-Through Pricing
OpenRouter is a model gateway that routes to official providers. Its prices generally track Anthropic and OpenAI's official rates within a few percent. For example, Claude Sonnet 4.5/4.6 on OpenRouter is around $3.00 input / $15.00 output per 1M tokens — essentially the same as Copilot's token rate.
So if you're comparing Copilot's per-token cost against OpenRouter for the same model, they're roughly equivalent. The savings from OpenRouter come from skipping the subscription, not from cheaper tokens.
Third-Party Relays (e.g. OpenAI-Hub): Dramatically Cheaper, Materially Riskier
A separate category of third-party services advertises prices that are 5–10× lower than official rates. Here's what one such relay (OpenAI-Hub) shows, converted to USD:
GPT-5.5
| Provider | Input | Output |
|---|---|---|
| GitHub Copilot | $5.00 / 1M | $30.00 / 1M |
| OpenRouter (approx.) | $5.00 / 1M | $30.00 / 1M |
| OpenAI-Hub relay | ~$0.42 / 1M (¥3.00) | ~$2.54 / 1M (¥18.00) |
| Relay is ~12× cheaper than Copilot |
GPT-5.4
| Provider | Input | Output |
|---|---|---|
| GitHub Copilot | $2.50 / 1M | $15.00 / 1M |
| OpenAI-Hub relay | ~$0.21 / 1M (¥1.50) | ~$1.27 / 1M (¥9.00) |
| Relay is ~12× cheaper than Copilot |
Claude Sonnet 4.6
| Provider | Input | Output |
|---|---|---|
| GitHub Copilot | $3.00 / 1M | $15.00 / 1M |
| OpenAI-Hub relay | ~$0.42 / 1M (¥3.00) | ~$2.11 / 1M (¥15.00) |
| Relay is ~7× cheaper than Copilot |
Claude Opus 4.7
| Provider | Input | Output |
|---|---|---|
| GitHub Copilot | $5.00 / 1M | $25.00 / 1M |
| OpenAI-Hub relay | ~$0.70 / 1M (¥5.00) | ~$3.52 / 1M (¥25.00) |
| Relay is ~7× cheaper than Copilot |
The Caveat That Matters More Than the Price
Prices this far below official rates are not a sign of clever procurement. They're a signal. Possible explanations include:
- Bulk-purchased API quotas resold at thin margins — legitimate but fragile, and may breach the upstream provider's terms of service.
- Shared or pooled API keys — your prompts may not be isolated from other tenants.
- Downgraded or quantized model variants delivered under the headline name.
- Routing through jurisdictions with different data-handling rules than Anthropic or OpenAI default to.
Before you put production code, customer data, or proprietary IP through a relay like this, you need clear answers on data retention, model authenticity, and uptime guarantees. For most professional work, the price advantage does not survive contact with security review.
When Copilot Is Still the Right Choice
GitHub Copilot remains the better value when:
- You rely heavily on autocomplete. Inline completions stay unlimited on paid plans and don't consume AI credits.
- Your usage is light to moderate. If you stay within the 1,500 premium requests on Pro+, you're paying $39 for IDE integration, all major models, and code review.
- You value the integrated experience. Native VS Code, JetBrains, GitHub PRs, CLI, and agent mode with no configuration to manage.
- You need a single billing relationship. One invoice, one vendor, one DPA — significant when working inside an enterprise.
When OpenRouter Makes Sense
OpenRouter becomes the better choice when:
- You run agent-based workflows outside Copilot. Cursor, Cline, Continue, and custom agents typically integrate with OpenRouter cleanly.
- You want per-task model choice without lock-in. OpenRouter lets you pick between Anthropic, OpenAI, Google, Mistral, and dozens of others under one API key.
- You've outgrown the Pro+ allowance. If you're consistently exhausting 1,500 premium requests, paying per-token at official rates through OpenRouter — with no subscription overhead — can come out ahead for the right usage pattern.
- You need fine-grained cost visibility. OpenRouter gives you per-request cost data, which is useful for charging back to projects or clients.
Crucially, OpenRouter doesn't undercut Copilot's per-token pricing in any meaningful way — they're sourcing from the same providers. The savings come from removing the subscription and from optimizing prompt/cache usage yourself.
A Realistic Cost Scenario
Let's model a heavy coding month for one developer:
Assumptions:
- 20 working days
- 50 agent-based coding sessions (refactors, debugging, feature builds)
- Average 30,000 input tokens + 10,000 output tokens per session
- Model: Claude Sonnet 4.6
Total token consumption:
- Input: 50 × 30,000 = 1.5M tokens
- Output: 50 × 10,000 = 0.5M tokens
Cost on GitHub Copilot Pro+ (tokens beyond allowance):
- Input: 1.5M × $3.00 = $4.50
- Output: 0.5M × $15.00 = $7.50
- Token cost: $12.00
- Plus subscription: $39.00
- Total: $51.00
Cost on OpenRouter (no subscription):
- Input: 1.5M × ~$3.00 = $4.50
- Output: 0.5M × ~$15.00 = $7.50
- Total: $12.00
Cost on a third-party relay (OpenAI-Hub-style, USD-converted):
- Input: 1.5M × ~$0.42 = $0.63
- Output: 0.5M × ~$2.11 = $1.06
- Total: ~$1.69 — but with the trust, authenticity, and data-handling questions raised above.
The honest read: at this usage level, you're paying roughly $39/month for Copilot's autocomplete, IDE integration, and bundled code review. If you genuinely use those features, that's defensible value. If you don't, OpenRouter delivers the same model quality for the raw token cost.
The Model That Quietly Wins on Value
For pure coding value, Claude Sonnet 4.6 is the best balance of capability and cost. It's priced identically to Sonnet 4.5 ($3.00 input / $15.00 output per 1M tokens) but positioned as a stronger coding and computer-use model.
For most professional coding work — refactoring, debugging, architecture, test generation — Sonnet 4.6 is sufficient. Reserve GPT-5.5 or Opus 4.7 (each roughly 2× the price) for tasks where mistakes are expensive: migrations, security reviews, complex system design.
Practical Recommendation
Don't pick one tool exclusively. The most cost-effective setup combines them:
| Use Case | Best Tool |
|---|---|
| Daily autocomplete in the IDE | Copilot Pro or Pro+ |
| Quick inline chat / small edits | Copilot (GPT-5.4 mini or Sonnet 4.6) |
| Agent-based coding outside the IDE | OpenRouter with Sonnet 4.6 |
| Hard architecture / debugging | GPT-5.5 or Opus 4.7, selectively |
| PR code review | Copilot (bundled) |
| Cost-optimized heavy bulk usage | OpenRouter at official rates |
| Production code, customer data | Avoid unverified relays |
What You Can Do Next
- Measure your actual token consumption for the next 30 days. GitHub will start showing per-token usage from June 1, 2026. Without that number, every cost decision is a guess.
- Test OpenRouter for one workflow. Set up Cursor or Continue with OpenRouter, run a typical week's worth of agent tasks, and compare the line-item cost against Copilot's token rate.
- Default to Sonnet 4.6. It's the best cost-per-capability model in the current lineup. Escalate to GPT-5.5 or Opus 4.7 only when the task genuinely needs it.
- Keep Copilot Pro for autocomplete even if you move agent work elsewhere. $10/month for unlimited inline completions is hard to beat.
- Reevaluate after June 1, 2026. Once token-based billing is live, compare a full month of measured usage against your subscription cost — and decide deliberately, not by habit.
The Real Tradeoff
GitHub Copilot is a bundled product. You're paying for integration, reliability, and convenience. OpenRouter is unbundled — you pay only for tokens at near-official rates, and you carry the configuration burden. Third-party relays are something else entirely: dramatically cheaper, but with material questions about authenticity, data handling, and continuity.
The decision isn't "which is cheaper?" It's "which cost structure matches how I actually code — and which trust boundary am I willing to operate inside?"
Sources:
- GitHub Copilot Models and Pricing — verified May 24, 2026
- GitHub Copilot Plans
- Anthropic Claude Sonnet 4.6 Announcement
- Third-party relay (OpenAI-Hub) public pricing screenshots, May 2026
Get practical posts on enterprise AI and transformation. Only useful updates, sent as a weekly digest.
One practical digest each week. Unsubscribe anytime.





