Skip to content
Asia

JP · 8 metros

Tokyo exits for Japanese corpora and Article 30-4 collection

AWS Tokyo throughput and NTT/KDDI/SoftBank residential. Japan has the most permissive TDM regime in the G7 — the SquadProxy JP pool is built around that legal anchor for AI teams scaling Japanese-language collection.

Carrier ASNs we size against

  • NTT Communications (AS2914)
  • KDDI (AS2516)
  • SoftBank (AS4725)
  • AWS ap-northeast-1 Tokyo (AS16509)
  • GCP asia-northeast1 (AS15169)
  • Rakuten Mobile (AS55690)
  • IIJ (AS2497)
  • JPNIC peering
statusmarkets10exit classes5edge rtt<50msuptime99.9%ops24/7

The Japan network, up close

What a Japan session actually looks like

Why Japan gets disproportionate weight in AI collection

Two reasons Japanese coverage matters beyond its share of the open web:

  • Legal clarity for TDM. Japan's Article 30-4 is, as of 2026, the broadest text-and-data-mining carve-out in any major economy. For AI teams based outside Japan but running collection from a Japanese datacenter with a legal basis in Japan, this materially reduces copyright exposure on training-corpus workloads that would be more contested under EU or UK law.
  • SquadProxy scarcity. High-quality Japanese long-form text is under- represented on the commodity corpus side (Common Crawl, CC-News). Active collection from Japanese-resident IPs returns substantially more JP content than collection via US exits, because many Japanese publishers geo-route or quietly geoblock.

What the JP pool routes through

  • Datacenter — AWS ap-northeast-1 (Tokyo), GCP asia-northeast1, IIJ-hosted infrastructure. Tokyo-to-JPNIC peering is one hop.
  • Residential — NTT fibre (the dominant home-broadband ASN), KDDI au Hikari, SoftBank Hikari. City-level routing in Tokyo 23 wards, Osaka, Kyoto, Fukuoka.
  • Mobile — NTT Docomo, KDDI au, SoftBank, Rakuten Mobile 4G/5G.

Tsukuba and the AIST corridor

AIST, RIKEN AIP, and the University of Tsukuba form a research corridor with unusually strong NLP output for the country's size. Collection that targets pre-print servers, institutional repositories, and academic mirrors in Tsukuba benefits from a local exit — some of these sources rate-limit foreign ASNs aggressively.

Good fit

  • Training-corpus collection for Japanese under Article 30-4
  • Evaluation of multilingual models on Japanese knowledge (J-MMLU, JGLUE-style benchmarks)
  • RAG ingestion of Japanese government (e-Gov, Kokkai), academic, and technical literature
  • Competitive intelligence against Japanese-market model APIs

Not a good fit

  • APPI-sensitive personal data without consent or cross-border safeguards
  • Circumvention of publisher opt-outs declared under the Article 30-4 "unreasonable prejudice" exception
  • Anything contacting Japanese minors-focused platforms

ASN depth in the Japanese residential pool

Japanese broadband concentrates around the major telecoms plus specific regional carriers:

  • NTT East (AS4713) and NTT West (AS2914) — dominant residential FTTH via Flet's. The NTT pool covers the largest share of Japanese residential traffic.
  • KDDI (AS2516) — au Hikari fiber + cable, strong urban share.
  • SoftBank (AS17676) — BB fiber + cable, second-largest broadband share after NTT.
  • J:COM (AS9605) — cable TV-based broadband, specific metros.

Mobile: NTT DoCoMo, KDDI au, SoftBank, Rakuten Mobile carry essentially all JP 4G/5G. Rakuten Mobile's CG-NAT footprint is distinct from the big three, useful for mobile evaluation diversity.

Top Japanese metros

  • Tokyo — dominant. Government, business, media (NHK, Asahi, Yomiuri, Nikkei), AWS ap-northeast-1 home region. Default origin for JP-anchored AI workloads.
  • Osaka — Kansai region anchor, business + manufacturing. AWS ap-northeast-3 home region (alternative APAC edge).
  • Nagoya — Chubu region, industrial (Toyota), distinct regional press (Chunichi Shimbun).
  • Fukuoka — Kyushu anchor, growing IT / startup cluster, distinct regional content.
  • Sapporo — Hokkaido anchor, northern Japan regional content.

Japanese legal landscape for AI data collection

  • APPI (Act on the Protection of Personal Information) — Japan's primary privacy framework. Extraterritorial reach for processing concerning Japanese data subjects. Cross-border transfers to jurisdictions without adequacy decision require specific safeguards.
  • Copyright Act Article 30-4 — broad exception permitting computational analysis (explicitly covering ML training) of copyrighted works, UNLESS the use "unreasonably prejudices" the rightsholder's interests. This is one of the most permissive TDM regimes globally, but the "unreasonable prejudice" carve-out has been invoked by some publishers — the legal boundary is evolving.
  • Telecom Business Act + MIC guidance — telecommunications operations involving Japanese subscribers have MIC oversight. SquadProxy operates with a JP-registered intermediary for the residential pool.
  • Japan AI Promotion Basic Act (2025) — the recent statute framing AI development and deployment. Voluntary guidelines at the moment but the compliance framework is tightening through 2026-2027.

Latency and edge routing

JP edge routes through AWS ap-northeast-1 (Tokyo). Latency:

  • OpenAI API (APAC routing): ~15-30ms p50 from Tokyo
  • Anthropic API: ~20-40ms p50
  • Google Gemini (asia-northeast1): ~10-20ms p50

Residential adds 30-120ms; mobile 50-180ms. NTT fiber footprints have the most consistent latency; SoftBank is slightly more variable under peak.

Related

Pricing

Japan proxies — same price, same network

Every plan covers the Japan pool and every other market we operate.

Solo

For individual researchers running evaluation scripts and prototype RAG pipelines.

$149/ month

or $1,430/year (save 20%)

50 GB residential · unlimited datacenter · 200 concurrent sessions

  • Access to all 5 exit classes · 10 focus countries
  • 50 GB residential · unlimited datacenter
  • 5 static ISP IPs · 5 GB 4G mobile
  • 1 seat · 200 concurrent sessions
  • Python + Node SDK + REST API
  • Per-request metering (not time-based)
  • Email support (24h response, business days)
  • Overage: $3/GB residential · $6/GB mobile

Best for

  • Solo researchers
  • Evaluation scripts
  • Prototype RAG

Team

Most popular

For AI startups and mid-size labs splitting capacity between training and evaluation.

$699/ month

or $6,710/year (save 20%)

500 GB residential · unlimited datacenter · 1,000 concurrent sessions

  • Access to all 5 exit classes · 10 focus countries
  • 500 GB residential · unlimited datacenter
  • 25 static ISP IPs · 25 GB 4G mobile
  • 10 seats ($29/mo per extra seat) · 1,000 concurrent sessions
  • City-level geo-routing + ASN targeting
  • 99.9% uptime SLA
  • Priority Slack support (4h response, business hours)
  • Python + Node SDK + REST API + webhooks
  • Overage: $3/GB residential · $6/GB mobile

Best for

  • AI startups
  • Mid-size labs
  • Model eval teams

Lab

For academic labs, eval consortia, and frontier model companies running sustained workloads.

$2,999/ month

or $28,790/year (save 20%)

2 TB residential · unlimited DC · 50 GB 4G + 20 GB 5G · 3,000 concurrent sessions

  • Access to all 5 exit classes · 10 countries on 4 continents
  • 2 TB residential · unlimited datacenter
  • 100 static ISP IPs · 50 GB 4G + 20 GB 5G mobile
  • 50 seats ($19/mo per extra seat) · 3,000 concurrent sessions
  • Dedicated gateway lane (bypasses shared-pool queues on us-east-1 + eu-west-1)
  • 99.95% uptime SLA
  • Dedicated Slack channel (1h response, business hours)
  • Custom BGP prefix on request (additional fees apply)
  • Overage: $2.50/GB residential · $5/GB mobile

Best for

  • Academic labs
  • Large eval consortia
  • Frontier model companies

Enterprise

Custom contracts with dedicated infrastructure, volume pricing, and research-grade SLAs.

Custom pricing

Custom (from 5 TB/mo residential) · unlimited concurrent sessions

  • Volume pricing from 5 TB/mo residential
  • Dedicated BGP prefix + ASN announcement
  • Unlimited concurrent sessions · unlimited seats
  • 99.99% uptime SLA with financial credits
  • Named Technical Account Manager + 24/7 on-call paging
  • Custom AUP, DPA, on-site deployment option
  • Research / academic discount (30–50% off Team or Lab)
  • Annual contract · wire, ACH, USDC/USDT/BTC settlement

Best for

  • Frontier labs
  • Eval consortia
  • Enterprise AI

All plans include 14-day refund, single endpoint with regional failover, HTTP(S) + SOCKS5 on every exit class, access to all 5 exit classes and all 10 focus countries, and Python + Node SDKs. Concurrent sessions = simultaneous TCP sessions through the gateway. Overage warnings fire at 80% and 100%; traffic continues only if overage billing is enabled on your account.

FAQ

Japan proxies FAQ

  • Which ASNs are in your Japan residential pool?
    Our Japan pool is sized against NTT Communications (AS2914), KDDI (AS2516), SoftBank (AS4725), with secondary coverage on the remaining carriers we list. ASN targeting is available on Team and up.
  • Can I target specific cities in Japan?
    Yes — city-level targeting is available via the X-Squad-City header, with a pool size appropriate to each city. Our largest Japan pools are in Tokyo, Osaka, Kyoto.
  • What's the legal footprint of using your Japan proxies?
    Japan has the most permissive training-data legal environment in the G7. Article 30-4 of the Copyright Act (revised 2018) explicitly permits text and data mining of copyrighted works for non-enjoyment purposes, which has been interpreted by the Agency for Cultural Affairs (ACA) to cover AI training. The APPI governs personal data; cross-border transfer requires either equivalent-country status, consent, or equivalent safeguards. There is no general public-figure carve-out in the way US law treats it.
  • What latency should I expect to Japan targets?
    Regional edge latency to major Japan metros runs 40–90ms on ISP and 150–300ms on rotating residential. Mobile 4G/5G lands around 80–200ms depending on carrier and time of day.

Ready to route through Japan?

Self-serve checkout delivers your JP credentials in under a minute.