The Problem
AI models have a knowledge cutoff. Anything after that date is simply unknown. For content within their training data, they reconstruct answers from statistical patterns—details can be wrong or conflated. They can’t cite a primary source because they don’t have one. And they have no access to your organization’s internal documents at all.
The result: plausible-sounding but subtly wrong answers—which is worse than “I don’t know.”
Honcho solves this by giving AI the actual source material. Ask a question, Honcho retrieves the real documents, and you get grounded answers with citations back to the originals.
- Recency
- Precision
- Provenance
The Short Version
Honcho is a content intelligence platform that indexes large collections of documents—research archives, institutional publications, news corpora, internal knowledge bases—and makes them searchable and accessible to AI assistants. Import tens of thousands of articles from a CMS, bulk-load a document archive, or crawl hundreds of feeds over time. Everything lands in the same full-text index.
What makes it powerful is what happens next. Connect Honcho to Claude or another AI assistant, and the entire corpus becomes conversational. Ask “how have expert views on China’s economy evolved over the past decade?” and the AI synthesizes across thousands of indexed documents—threading together analysis, tracking how positions shifted, surfacing connections that no keyword search would find. Combine results from your library with live web content to compare, verify, and fill gaps.
Content flows in from many directions: bulk importers for CMS migrations and document archives, an extract API with declarative rules for structured data sources, pull replication from WordPress and other CMS platforms, RSS/Atom crawling for ongoing feeds, pluggable connectors for custom sources, and AI assistants that save articles and notes on your behalf.
It supports multiple users and groups, so a team can share a common pool of content and collaborate through their AI assistants. An analyst can summarize a report and share it to the team’s feed. A writer can research across the entire corpus and send findings to colleagues. A shared knowledge base builds organically without anyone copying and pasting links around.
Honcho is also a persistence layer for AI workflows. Agent apps that summarize articles, monitor topics, or produce analysis can write their output back to Honcho, where it becomes searchable alongside everything else. The content you curate and the content your tools produce all live in one place.
Cross-topic synthesis across years of financial commentary. Briefings assembled from dozens of sources in minutes. Expert opinion tracked over time. Pattern recognition across industries.
See real use cases →What It Does
Most content platforms are assembled from separate services—a CMS for storage, Elasticsearch for search, a feed reader for ingestion, S3 for assets, a custom API layer to glue it all together. Honcho replaces that stack with a single integrated system deployed on your infrastructure.
- Aggregate
- Ingest
- Index
- Store
- Serve
- Replicate
| Concern | Typical Stack | Honcho |
|---|---|---|
| Storage | PostgreSQL / DynamoDB | Built-in |
| Search | Algolia / Elasticsearch | Built-in |
| Crawling | Scrapy / custom crawlers | Built-in |
| Ingestion | Custom importers / ETL | Built-in (MCP, save URL, extract API, bulk import) |
| Assets | S3 / cloud storage | Built-in |
| API | Custom REST / GraphQL | Built-in (GraphQL + REST + MCP) |
| Sync | Custom ETL / webhooks | Built-in |
What Makes It Different
- Search is built in, not bolted on
- Structured content fragments
- Declarative content extraction
- AI-powered search & summarization
- One system, not six
Architecture
Built on standard enterprise infrastructure that any Java team can deploy and maintain. No exotic dependencies, no cloud-specific lock-in, no operational surprises.
- Runtime
- Search engine
- API layer
- Data model
- Instrumentation
- Text analysis
- Custom fields
Built-in MCP Server
Honcho includes a built-in Model Context Protocol server,
so AI assistants like Claude can search, retrieve, and create content directly. Tools cover
search, content management, collaboration, memory, and personalized digest. Point any MCP client at the
/mcp endpoint and the entire index becomes conversational.
See practical use cases →
| Tool | What It Does |
|---|---|
search_content |
Full-text search with host, author, date range, tag, topic, type, and sort filters. Natural language queries are automatically enhanced to Lucene syntax when LLM integration is enabled. |
get_entry |
Retrieve a single entry by UID or most recent |
get_entries |
Retrieve multiple entries by UID in a single call (max 25) |
find_similar_entries |
Find content similar to a given entry |
list_hosts |
List content sources, optionally filtered |
term_frequency |
Term frequency statistics for any indexed field |
create_entry |
Create a new entry with title, content, tags, topics, type, author, and metadata |
update_entry |
Update fields on an existing entry—replace or append tags/topics, merge metadata |
delete_entry |
Soft-delete an entry from the database and search index |
tag_entries |
Add or remove tags from entries matching a search query |
list_groups |
List your collaborative groups with members and shared feed status |
discover_feeds |
Discover RSS/Atom feeds from any URL |
add_source |
Add a content source and enable crawling |
add_source_to_group |
Share an existing source with a group so all members can access it |
remove_source_from_group |
Remove a source from a group |
send_message |
Send a message to a group’s shared feed |
share_to_group |
Share an existing entry to a group feed, preserving original author |
get_status |
Account overview—sources, entries, groups, and favorites |
save_memory |
Save a note that can be recalled in future conversations |
recall_memory |
Search or list saved memories |
get_digest |
Get a personalized feed from favorited authors, sources, hosts, and saved searches |
list_favorites |
List favorites with enabled/disabled status |
add_favorite |
Follow an author, source, host, or search query |
remove_favorite |
Remove a favorite from the personalized digest |
- Read and write
- Structured retrieval
- Multi-source aggregation
- Real-time content
- Group collaboration
- AI memory
- Messaging
- Cross-posting
- Personalized digest
- Feed management
- Multi-user isolation
Use Cases
- Research & analysis
- Team knowledge base
- Content aggregation and monitoring
- CMS integration & replication
- Headless search API
- Content archiving
Status
Honcho is actively developed and available for licensing, collaboration, or investment.