Java Native AI Gateway for Claude, OpenAI, Llama, and More

𝗝𝗮𝘃𝗮 𝘁𝗲𝗮𝗺𝘀 𝘀𝗵𝗼𝘂𝗹𝗱𝗻’𝘁 𝗻𝗲𝗲𝗱 𝗣𝘆𝘁𝗵𝗼𝗻 𝘁𝗼 𝗯𝘂𝗶𝗹𝗱 𝗔𝗜. So for the last 18 months, Java teams have been duct-taping LLM SDKs into their stack. I decided to build what Java teams actually need. Meet 𝗡𝗮𝘂𝘁𝗶𝗹𝘂𝘀 — a Java-native AI Gateway, now open source on GitHub. One API across Claude, OpenAI, Llama, Mistral, and more. Built with Spring Boot 3 and Java 21. No Python sidecars. No reverse-proxy hacks. No “let’s switch stacks for this one service.” 𝗪𝗵𝗮𝘁’𝘀 𝗶𝗻𝘀𝗶𝗱𝗲: → Smart routing across providers (priority, round-robin, random, cost-aware) → Automatic fallback on rate limits, timeouts, and 5xx errors → Semantic caching (pgvector + Redis hot tier) — same prompt, no second bill → Per-key 𝗿𝗮𝘁𝗲 𝗹𝗶𝗺𝗶𝘁𝗶𝗻𝗴 + 𝗰𝗼𝘀𝘁 𝘁𝗿𝗮𝗰𝗸𝗶𝗻𝗴 in tokens and USD → Cross-cutting concerns done right: log redaction, audit trails, validation → Observability you already use — Micrometer, Prometheus, OpenTelemetry We’re one week away from v0.1. The core foundation is ready: provider SPI, routing, fallback, and OpenAI-compatible API. 𝗡𝗲𝘅𝘁 𝘂𝗽: → Native Claude + OpenAI adapters → Streaming support → Spring Boot starter If you're building on the JVM and waiting for first-class AI infrastructure in Java — this is for you. ⭐ Stars and early feedback will help us shape what ships first. Repo link in the first comment ↓ #Java #SpringBoot #AI #LLM #OpenSource #DeveloperTools #Github

  • No alternative text description for this image

To view or add a comment, sign in

Explore content categories