Home

If you find any broken links in any posts or have any questions, feel free to send an email me[@]manta.black.

Recent

Harness Engineering Thoughts and Notes

Given the influx of LLM agent tooling, blog posts, guides, research, and explosion of takes, I wanted to capture my perspective of what I have been reading, seeing, and some experiments that I have been dabbling in or on. I do not proclaim to be an expert in the space, and I am writing this by hand to stretch my mind, and I hope that what I compile and share becomes valuable to some person or agent.
·3 mins

Simeon

Trading Learned Embeddings for Determinism and Speed in YAMS # YAMS started with a goal, experiment with a local memory system that was cli first, that reduced the amount of tokens it took models to find materials for a task. Then it evolved into a learning endeavor as I attempted to ship a working hybrid search engine that is fast enough to run locally, predictable enough to tune, and work generally with how I used models. To get there quickly, we initially leaned on a conventional learned-embedding stack via ONNX Runtime for embedding generation and re-ranking. It worked, and it allowed us to prove out ingestion, storage, and retrieval end-to-end with a simple cli and mcp scheme.
·4 mins

What Have I Been Up To?

It’s been about seven months since I last posted here (about YAMS), so I wanted to share a quick recap of what I’ve been up to. If you measure that gap in AI progress, we’ve seen an incredible acceleration in both performance and the pace of advancement. In the time that I’ve been absent, I’ve been working on pushing YAMS forward. I’ve learned a lot of hard-won lessons about building a memory system for AI in C++. In terms of where I want to be, I’m about 75% of the way to getting the project to a place where it’s both usable and beneficial for others.
·2 mins

Yams

YAMS (Yet Another Memory System) started as a practical need: I wanted a dead-simple way to store and retrieve files, snippets, and research for LLM-driven workflows — without losing context between sessions. What I use daily is now ready to share. Note: YAMS is v0.7.x - experimental software under active development, not yet production-ready. Updated docs (2025-10-13): Site: https://yamsmemory.ai CLI: https://yamsmemory.ai/user_guide/cli/ MCP server: https://yamsmemory.ai/user_guide/mcp/ Deployment: https://yamsmemory.ai/operations/deployment/ Why YAMS? # Persistent memory for LLMs and tools Content-addressed storage (SHA-256) with block-level deduplication (Rabin) Compression (zstd/LZMA), crash safety (WAL) Fast search: SQLite FTS5 and semantic vector search Simple CLI and TUI; MCP server for Claude/Desktop and other MCP clients Now with alpha plugin support Quick install # Docker:
·6 mins

Following Up On Entity Extraction In Umbrix

So I wanted to post a follow-up to my previous introduction to the umbrix platform, where I was using DSPy for entity extraction in cyber threat intelligence. If you missed that, you can find it here . This recap comes a week after I launched the platform. Two days into getting Umbrix running optimally, I started running into many problems. I have been extracting a significant volume of entities, relationships, and nodes from my agents to populate in the graph. However, I quickly realized my system was burning through resources - both in terms of rate limits and agent costs.
·5 mins