Skip to content

pptcraft

ppt-craft is an unofficial, local “Claude-for-PowerPoint” add-in designed for the GB10 architecture that runs directly inside PowerPoint (online or desktop) 1. It replaces external API dependencies by leveraging a local Qwen3.6 vLLM backend to orchestrate AI-driven generation of presentation decks. The system enables users to interact with a Claude-style taskpane for chat and slide management while the backend handles the complex rendering and OOXML manipulation.

The architecture relies on a shared vLLM endpoint to manage the heavy computational load of the 35B parameter model, which requires approximately 35 GB of VRAM and a 3-minute cold-load time. By sharing this instance across multiple clients, ppt-craft optimizes resource usage while providing a REST API interface that connects the Office.js add-in to the Python-pptx and lxml-based rendering engine.

diagram
Subsystem Description
UI Clients The PowerPoint host environment running the Office.js taskpane SPA for chat and slide tree interaction.
REST API The FastAPI server running on port 3030 that handles requests and manages the connection between the UI and the backend.
Local LLM The shared vLLM endpoint hosting the Qwen3.6-FP8 model, managed by autosre and accessible at localhost:8010.
Rendering Engine The core logic using python-pptx and lxml to unpack, edit, and validate OOXML structures for slide generation.