GitHub Trends

#python #ai #copilot #development #engineering #prd #spec #spec_driven

# Spec Kit: Build Software Faster with Specifications

Spec Kit is an open-source toolkit that helps you create high-quality software quickly by focusing on what you want to build rather than writing code from scratch. Instead of starting with code, you write clear specifications describing your goals, then the system generates a technical plan and breaks it into tasks that an AI coding agent executes. This approach—called Spec-Driven Development—flips traditional software development by making specifications executable and central to the process. You benefit by reducing rework, maintaining consistency across your project, and letting AI handle repetitive coding while you focus on the important decisions about what to build and why.

https://github.com/github/spec-kit

GitHub

GitHub - github/spec-kit: 💫 Toolkit to help you get started with Spec-Driven Development

💫 Toolkit to help you get started with Spec-Driven Development - github/spec-kit

👏1

461 views12:30

GitHub Trends

#swift #cpp #csharp #go #ios #java #lightweight #nodejs #on_device #python #rust #swift #text_to_speech #tts #web

Supertonic is a fast, lightweight text-to-speech system that runs directly on your device without needing the internet or cloud services. It supports 31 languages and works across phones, computers, browsers, and other platforms. The system is small enough to run on devices like Raspberry Pi while staying accurate and quick. You get complete privacy since everything happens locally on your device, and you can use it for free with no network dependency. It handles complex text like phone numbers and currency amounts better than many larger systems.

https://github.com/supertone-inc/supertonic

GitHub

GitHub - supertone-inc/supertonic: Lightning-Fast, On-Device, Multilingual TTS — running natively via ONNX.

Lightning-Fast, On-Device, Multilingual TTS — running natively via ONNX. - supertone-inc/supertonic

526 views13:30

GitHub Trends

#python #agents #llm #rag #skills #video_analytics #video_search #vlm

NVIDIA's Video Search and Summarization blueprint lets you build AI agents for fast video analysis, including real-time intelligence, alerts, natural language search, Q&A, and long-video summaries using vision models and NIM microservices. Deploy easily via Docker or cloud notebooks on supported NVIDIA hardware. This saves you time analyzing huge video volumes for smart monitoring, warehouses, or operations, boosting decisions and efficiency with ready workflows and custom options.

https://github.com/NVIDIA-AI-Blueprints/video-search-and-summarization

GitHub

GitHub - NVIDIA-AI-Blueprints/video-search-and-summarization: Suite of reference architectures for building GPU-accelerated vision…

Suite of reference architectures for building GPU-accelerated vision agents and AI-powered video analytics applications. - NVIDIA-AI-Blueprints/video-search-and-summarization

❤1

524 views12:00

GitHub Trends

#python #automation #claude #mcp #notebooklm #skill

This tool turns many kinds of content, like web pages, PDFs, podcasts, videos, and files, into formats like podcasts, PPTs, mind maps, quizzes, and reports with NotebookLM. It can also handle many sources automatically, including public pages, social posts, and some paywalled articles, and it supports Chinese and English well. The benefit to you is simple: you can save time, understand content faster, and reuse the same material in the format you need for study, work, or sharing.

https://github.com/joeseesun/qiaomu-anything-to-notebooklm

GitHub

GitHub - joeseesun/qiaomu-anything-to-notebooklm: Claude Skill: Multi-source content processor for NotebookLM. Supports WeChat…

Claude Skill: Multi-source content processor for NotebookLM. Supports WeChat articles, web pages, YouTube, PDF, Markdown, search queries → Podcast/PPT/MindMap/Quiz etc. - joeseesun/qiaomu-anything-...

609 views11:30

GitHub Trends

#jupyter_notebook #agent #agent_framework #agents #ai_agents #deployment #genai #generative_ai #langgraph #llm #llms #mlops #production #python #tutorials

Agents Towards Production is a free open-source guide for building AI agents that work in real products. It gives runnable tutorials on memory, tools, search, deployment, security, monitoring, testing, and user interfaces. You can use it to learn faster, build with less guesswork, and move from a simple prototype to a more reliable, scalable agent system.

https://github.com/NirDiamant/agents-towards-production

GitHub

GitHub - NirDiamant/agents-towards-production: End-to-end, code-first tutorials for building production-grade GenAI agents. From…

End-to-end, code-first tutorials for building production-grade GenAI agents. From prototype to enterprise deployment. - NirDiamant/agents-towards-production

414 views11:30

GitHub Trends

#python #ai #ai_agents #conversational_ai #fastapi #llm #nextjs #open_source #outbound_calls #pipecat #python #self_hosted #speech_to_text #telephony #text_to_speech #voice #voice_agents #voice_ai #voice_assistant #voip #webrtc

Dograh AI is an open-source, self-hostable tool for building voice agents with a drag-and-drop workflow. You can start fast, run it on your own server, use your own LLM, TTS, and STT services, and avoid vendor lock-in. The benefit to you is more control, more privacy, and a working voice bot in minutes without needing API keys.

https://github.com/dograh-hq/dograh

GitHub

GitHub - dograh-hq/dograh: Open Source Voice Agent Platform

Open Source Voice Agent Platform. Contribute to dograh-hq/dograh development by creating an account on GitHub.

415 views12:00

GitHub Trends

#python #ai_agents #amd #comfyui #docker #llama_cpp #llm #local_ai #n8n #nvidia #open_webui #rag #self_hosted #speech_to_text #strix_halo #text_to_speech #workflow_automation

Dream Server lets you run AI on your own machine instead of renting it from a cloud service. It works on Linux, Windows, and macOS, and it can set up chat, voice, agents, search, image tools, and privacy tools with one command. The main benefit is more control: your data stays with you, costs can be lower, and you can keep using AI even without a cloud account.

https://github.com/Light-Heart-Labs/DreamServer

GitHub

GitHub - Light-Heart-Labs/DreamServer: Local AI anywhere, for everyone — LLM inference, chat UI, voice, agents, workflows, RAG…

Local AI anywhere, for everyone — LLM inference, chat UI, voice, agents, workflows, RAG, and image generation. No cloud, no subscriptions. - Light-Heart-Labs/DreamServer

❤1

484 views13:00

GitHub Trends

#python

CLI-Anything turns software into agent-ready command line tools, so AI agents can use real apps through simple text commands instead of fragile GUI steps. It supports one-command setup, a 7-step build process, JSON output, REPL use, and a CLI-Hub for browsing and installing tools. The benefit to you is faster automation, less setup work, and more reliable control of software like Blender, GIMP, LibreOffice, and many others.

https://github.com/HKUDS/CLI-Anything

GitHub

GitHub - HKUDS/CLI-Anything: "CLI-Anything: Making ALL Software Agent-Native" -- CLI-Hub: https://clianything.cc/

"CLI-Anything: Making ALL Software Agent-Native" -- CLI-Hub: https://clianything.cc/ - HKUDS/CLI-Anything

❤1

514 views13:30

GitHub Trends

#python #agentic_aigc #video_generation

ViMax is an AI tool that can turn an idea, story, novel, or script into a video by itself. It helps with writing the script, planning scenes, choosing reference images, keeping characters consistent, and making the final video. This saves time and makes video creation easier for you, especially if you want to make videos without doing all the technical work.

https://github.com/HKUDS/ViMax

GitHub

GitHub - HKUDS/ViMax: "ViMax: Agentic Video Generation (Director, Screenwriter, Producer, and Video Generator All-in-One)"

"ViMax: Agentic Video Generation (Director, Screenwriter, Producer, and Video Generator All-in-One)" - HKUDS/ViMax

👍1

378 views12:30

GitHub Trends

#python #agents #ai #ai_agents #ai_engineering #computer_vision #course #deep_learning #from_scratch #generative_ai #llm #machine_learning #mcp #nlp #python #reinforcement_learning #rust #swarm_intelligence #transformers #tutorial #typescript

This is a free MIT learning guide for AI engineering with 428 lessons in 20 phases. It teaches you AI from the math up, then moves into machine learning, deep learning, LLMs, agents, tools, safety, and production. Each lesson helps you build useful code or AI tools, not just read theory. You can start at the right level, follow a clear path, and keep reusable artifacts for real work. The benefit is simple: you learn how AI actually works and gain practical skills you can use to build and ship better AI systems.

https://github.com/rohitg00/ai-engineering-from-scratch

GitHub

GitHub - rohitg00/ai-engineering-from-scratch: Learn it. Build it. Ship it for others.

Learn it. Build it. Ship it for others. Contribute to rohitg00/ai-engineering-from-scratch development by creating an account on GitHub.

👍1

333 views14:00

GitHub Trends

#python #agentic_ai #agentic_workflow #agents #function_calling #llama_cpp #llamafile #llm #ollama #python #self_hosted #tool_calling

Forge is a Python tool that makes self-hosted LLM tool-calling more reliable. It helps local models handle multi-step tasks with guardrails, better context control, and support for Ollama, llama-server, Llamafile, and Anthropic. You can use it as a workflow runner, middleware, or proxy server with OpenAI-style clients. The benefit is fewer broken tool calls, better results on small models, and easier setup for agent apps, chat tools, and long-running sessions.

https://github.com/antoinezambelli/forge

GitHub

GitHub - antoinezambelli/forge: A Python framework for self-hosted LLM tool-calling and multi-step agentic workflows

A Python framework for self-hosted LLM tool-calling and multi-step agentic workflows - antoinezambelli/forge

175 views12:30

About

Blog

Apps

Platform