TypeScript SDK for different in-browser AI model providers, built to make client-side AI integration simpler and more consistent across vendors.
-
Updated
Jun 27, 2026 - TypeScript
TypeScript SDK for different in-browser AI model providers, built to make client-side AI integration simpler and more consistent across vendors.
Your users download a 4GB AI model, the connection drops at 3.8GB, verifyfetch resumes from 3.8GB and verifies every byte. Drop-in integrity verification for Transformers.js, WebLLM, and any large file in the browser.
A lightweight recreation of OS1/Samantha from the movie Her, running locally in the browser
A Webapp that uses Retrieval Augmented Generation (RAG) and Large Language Models to interact with a PDF directly in the browser.
Hybrid semantic search and AI curation for your vault. Combines BM25 keyword, on-device WebGPU embeddings, and fuzzy title matching. Multilingual, with particularly strong Chinese/CJK support. Opt-in AI features auto-generate note descriptions and topic-grouped Maps of Content. Local-first, no API keys.
interactive semantic search demo using Qwen3-0.6B-Embedding in your browser
The web application for Texo, a minimalist SOTA LaTeX OCR model which contains only 20M parameters runs in browser. | 超轻量SOTA LaTeX公式识别模型,20M参数量的 web 应用仓库
A local-first Chrome extension that passively captures ChatGPT, Gemini, Claude, Grok, Perplexity conversations into a private memory graph. Features in-browser Hybrid RAG (Vector + BM25), semantic search, and 100% privacy via WebAssembly and IndexedDB. No servers, no API keys.
Free deep-learning monitoring PWA for pets, babies, and elderly care. Real-time computer vision (TensorFlow.js COCO-SSD + Transformers.js ViT) runs entirely on-device, fully offline. Privacy-first.
100% client-side entity-based topic clustering tool. ML models run in your browser — extract entities, resolve to Wikidata, visualize as knowledge graph.
An MCP-enabled Qwen3 0.6B demo with adjustable thinking budget, all in your browser!
TypeScript implementation of Google's TurboQuant algorithm for near-optimal vector quantization. Zero dependencies, works in Node.js and browsers.
Private, on-device AI meeting notetaker that runs entirely in your browser — live transcription, speaker ID, and note extraction with no audio leaving your machine (WebGPU + WASM).
🚀🛸 Easily boost the speed of pulling your models and datasets from various of inference runtimes. (e.g. 🤗 HuggingFace, 🐫 Ollama, vLLM, and more!)
LFM2.5 1.2 billion reasoning language model runs on web browsers using Transformers.js and ONNX Runtime Web.
A Chrome extension that uses PromptAPI and vector embeddings to help users quickly find answers about the current website's content.
Boot a real x86 Alpine Linux VM in your browser (v86), attach xterm consoles, and let a local AI agent run tools inside the guest—Transformers.js/Ollama, static SPA, no cloud VM required.
Always-on companion for Claude that remembers your decisions and their evolution. Local-first memory using SQLite + transformers.js embeddings.
Give your AI a perfect, infinite memory. A local-first, zero-LLM memory system and Model Context Protocol (MCP) server designed to give AI assistants (like Claude, ChatGPT, and custom agents) a searchable, structured "Memory Palace."
Add a description, image, and links to the transformers-js topic page so that developers can more easily learn about it.
To associate your repository with the transformers-js topic, visit your repo's landing page and select "manage topics."