Skip to content

Moss Client SDKs

Sub-10 ms retrieval for conversational AI, voice assistants, and multimodal agents. On-device, in-browser, or in the cloud.

Why Moss

Moss is a high-performance runtime for real-time semantic search. It delivers sub-10 ms retrieval, instant index updates, and zero infrastructure overhead. It runs wherever your intelligence lives — in-browser, on-device, or in the cloud — so search feels native and effortless.

TIP

Head to Moss Portal to set up projects and start building with sub-10ms search.

Common use cases

  • Copilot memory — Recall user context instantly, even offline
  • Docs search — Fast, private search inside help centers and knowledge bases
  • Desktop productivity — Smart search in note apps or IDEs without sending data online
  • AI-native apps — Sub-10ms search on phones and AI-PCs with no lag even on bad networks

Quick start

bash
npm install @inferedge/moss
bash
pip install inferedge-moss
ts
import { MossClient } from '@inferedge/moss'

const client = new MossClient(process.env.PROJECT_ID!, process.env.PROJECT_KEY!)
await client.createIndex('docs', [{ id: '1', text: 'Vector search in production' }])

await client.loadIndex('docs')
const results = await client.query('docs', 'production search tips')
py
from inferedge_moss import MossClient, DocumentInfo

client = MossClient("$PROJECT_ID", "$PROJECT_KEY")
await client.create_index(
    "docs",
    [DocumentInfo(id="1", text="Vector search in production")],
)

await client.load_index("docs")
results = await client.query("docs", "production search tips")

Next steps

TaskLink
Project setup and credentialsGetting Started
JavaScript usage and API docsJavaScript SDK
Python usage and API docsPython SDK

Contact

For support, commercial licensing, or partnership inquiries: contact@moss.dev