What consulting services does Kiryl Rusanau offer?

I offer six services focused on Java + AI integration: Java Stack AI Assessment (identify where LLMs add value in your Spring Boot or Quarkus services), AI Integration Roadmap (phased plan with framework recommendations), LLM Integration Sprint (2-4 week engagement to ship a specific AI feature), Java AI Workshop (hands-on team training with LangChain4j and Spring AI), AI-Ready Architecture Review, and AI Code & Cost Review. All services are built around existing Java systems.

How can I book a consultation with Kiryl Rusanau?

The easiest way is to book a free consultation directly through my calendar (link available on this page). You can also reach me via email at kr.rusanov@gmail.com or through LinkedIn. I typically respond within 24 hours.

What is Kiryl Rusanau's AI engineering experience?

I have production experience integrating LLM capabilities into enterprise Java applications using LangChain4j and Spring AI frameworks. This includes building RAG (Retrieval-Augmented Generation) pipelines with vector databases, implementing prompt chains for structured output parsing, and deploying conversational AI systems. My AI work spans both enterprise integrations and consumer-facing AI products.

What AI frameworks does Kiryl Rusanau work with?

My primary AI toolkit includes LangChain4j for Java-based LLM integration, Spring AI for enterprise Spring Boot applications, and LangGraph (Python) for multi-agent workflow orchestration. I work with various LLM providers and have experience implementing vector stores, embeddings pipelines, and production-grade prompt engineering patterns.

How does Kiryl Rusanau integrate AI into enterprise applications?

I follow a structured approach: designing RAG architectures for domain-specific knowledge retrieval, implementing vector embeddings with appropriate chunking strategies, building prompt chains that produce reliable structured outputs, and ensuring proper error handling and fallback mechanisms. The integration respects enterprise concerns like security, observability, and cost optimization.

What AI products has Kiryl Rusanau built?

I built CartSync, a conversational AI grocery planning assistant on Telegram (now completed) that demonstrated practical AI integration in consumer applications. I'm also developing Quibench, an AI-powered productivity platform — you can see a demo at https://youtu.be/16qD_FMklvA. Both projects showcase real-world AI integration patterns.

Quibench is an AI productivity platform I'm developing that focuses on workflow optimization using LLM technologies. It aims to enhance team collaboration and individual productivity through intelligent automation and AI-assisted task management. Watch the demo: https://youtu.be/16qD_FMklvA

What is Kiryl Rusanau's technical background?

I have 7+ years of experience as a full-stack engineer with deep expertise in Java (8-21), Spring Boot ecosystem, React with TypeScript, and AWS cloud infrastructure. My background includes building microservices at scale, implementing OAuth2/OIDC authentication systems, and working with event-driven architectures using Kafka.

What is Kiryl Rusanau's FinTech experience?

I spent nearly 4 years at Azati Corporation working on financial technology projects including cryptocurrency custody platforms with Fireblocks integration, securities registration systems for regulatory compliance, and banking authentication systems managing identity across multiple services. This experience shaped my understanding of security-first development and compliance requirements.

Is Kiryl Rusanau available for consulting or contract work?

Yes, I offer six consulting services focused on AI integration for Java-powered businesses: Java Stack AI Assessment, AI Integration Roadmap, LLM Integration Sprint, Java AI Workshop, AI-Ready Architecture Review, and AI Code & Cost Review. Book a free consultation through the link on this page, or email kr.rusanov@gmail.com.

Where is Kiryl Rusanau located and what's the best way to contact?

I'm based in Poland and work with clients worldwide. The best way to start is by booking a free consultation (link available on this page). You can also reach me via email at kr.rusanov@gmail.com or through LinkedIn.

MCP Server Performance: What 39.9 Million Requests Say About Language Choice

Everyone building AI agents defaults to Python for MCP servers. After reading through TM Dev Lab's benchmark — 39.9 million requests across 15 implementations — I think that's a mistake for production.

What They Tested

Streamable HTTP transport (the current MCP standard), k6 load testing, 50 concurrent virtual users, Docker containers capped at 1 CPU and 1GB memory. Four workload types: Fibonacci (CPU), external fetch (I/O), JSON transformation, and database simulation. Three independent test rounds. The core four implementations:

Language	Avg Latency	RPS	Memory	CPU at load
Java (Spring Boot)	0.835ms	1,624	226MB	30%
Go	0.855ms	1,624	18MB	28%
Node.js	10.66ms	559	110MB	93%
Python (FastMCP)	26.45ms	292	98MB	99%

Zero errors across all requests. All four are reliable. The question is performance and what it costs you.

Two Tiers, Not Four

Java and Go are in one category. Node.js and Python are in another.

Java and Go both hit 1,624 RPS at sub-millisecond latency. They ran at 28–30% CPU under full load — headroom to scale. Go's throughput variability across three rounds was 0.5%, Java's was 0.7%. Tight.

Node.js peaked at 559 RPS and ran at 93% CPU. Python managed 292 RPS at 99% CPU. Both were saturated. No headroom. Python's variability was 9%, with an 8% throughput drop in round 2.

The gap between tiers isn't incremental. It's 3–5x on RPS, 13–30x on latency.

The Go Case

Go and Java are statistically tied on throughput and latency. 0.835ms vs 0.855ms is noise. Where Go wins is memory: 18MB vs 226MB. That's 12.8x better memory efficiency at equivalent performance. In Kubernetes, more replicas per node. In cost-sensitive deployments, real money at scale. The benchmark measured 92.6 RPS/MB for Go vs 7.2 RPS/MB for Java.

I build AI agents with Spring AI and Spring Boot. The Java numbers are solid and I'm not switching stacks. But for a new MCP service with no existing Java infrastructure, the Go argument is hard to dismiss.

Python's Actual Numbers

I expected Python to be slower. I didn't expect 84x.

CPU-bound Fibonacci: Java at 0.37ms, Go at 0.39ms, Python at 30.83ms. I/O fetch: Go at 1.29ms, Python at 80.92ms — 61x slower. The CPU gap is the GIL. Python's Global Interpreter Lock means even threaded Python runs one thread at a time. FastMCP on single-worker uvicorn saturated at 99% CPU while delivering 292 RPS. Java and Go handled 1,624 RPS at 30% CPU.

Multi-worker uvicorn and uvloop help. But you're still fighting the GIL on CPU-bound work. And most people don't tune their MCP server setup past the default config.

What This Means Practically

For prototyping and internal tools: Python is fine. Iteration speed beats runtime performance when you're testing whether an MCP tool is worth building.

For production at real load: Python and Node.js will hit their ceiling. Node.js at 93% CPU saturation has no room for traffic spikes. Python at 26ms average adds latency to every tool call your agent makes.

Go is the rational production choice without existing Java infrastructure. Same performance, a fraction of the memory, best consistency numbers in the test. The tradeoff: smaller AI/ML library ecosystem than Python.

Java makes sense if you already run Java services. Spring AI integration is solid, the JVM handles the load well, and 226MB is a fine tradeoff when Spring AI is doing the heavy lifting.

Sources: TM Dev Lab Benchmark v2 · v1 baseline · GitHub: benchmark-mcp-servers