lucidrains/deep-daze
Simple command line tool for text to image generation using OpenAI's CLIP and Siren (Implicit neural representation network)
Language: Python
#artificial_intelligence #deep_learning #implicit_neural_representation #multi_modality #siren #text_to_image #transformers
Stars: 127 Issues: 5 Forks: 11
https://github.com/lucidrains/deep-daze
Simple command line tool for text to image generation using OpenAI's CLIP and Siren (Implicit neural representation network)
Language: Python
#artificial_intelligence #deep_learning #implicit_neural_representation #multi_modality #siren #text_to_image #transformers
Stars: 127 Issues: 5 Forks: 11
https://github.com/lucidrains/deep-daze
GitHub
GitHub - lucidrains/deep-daze: Simple command line tool for text to image generation using OpenAI's CLIP and Siren (Implicit neuralβ¦
Simple command line tool for text to image generation using OpenAI's CLIP and Siren (Implicit neural representation network). Technique was originally created by https://twitter.com/advadno...
OFA-Sys/Chinese-CLIP
Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.
Language: Python
#chinese #computer_vision #multi_modal_learning #nlp #pytorch #vision_and_language_pre_training
Stars: 80 Issues: 0 Forks: 7
https://github.com/OFA-Sys/Chinese-CLIP
Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.
Language: Python
#chinese #computer_vision #multi_modal_learning #nlp #pytorch #vision_and_language_pre_training
Stars: 80 Issues: 0 Forks: 7
https://github.com/OFA-Sys/Chinese-CLIP
GitHub
GitHub - OFA-Sys/Chinese-CLIP: Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.
Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation. - OFA-Sys/Chinese-CLIP
π1π₯1
MasterBin-IIAU/Unicorn
[ECCV'22 Oral] Towards Grand Unification of Object Tracking
Language: Python
#multi_object_tracking_segmentation #multiple_object_tracking #object_tracking #single_object_tracking #video_object_segmentation
Stars: 132 Issues: 1 Forks: 5
https://github.com/MasterBin-IIAU/Unicorn
[ECCV'22 Oral] Towards Grand Unification of Object Tracking
Language: Python
#multi_object_tracking_segmentation #multiple_object_tracking #object_tracking #single_object_tracking #video_object_segmentation
Stars: 132 Issues: 1 Forks: 5
https://github.com/MasterBin-IIAU/Unicorn
GitHub
GitHub - MasterBin-IIAU/Unicorn: [ECCV'22 Oral] Towards Grand Unification of Object Tracking
[ECCV'22 Oral] Towards Grand Unification of Object Tracking - MasterBin-IIAU/Unicorn
π1
kubewharf/kubezoo
a lightweight kubernetes multi-tenancy gateway
Language: Go
#kubernetes #multi_tenancy #serverless
Stars: 136 Issues: 1 Forks: 15
https://github.com/kubewharf/kubezoo
a lightweight kubernetes multi-tenancy gateway
Language: Go
#kubernetes #multi_tenancy #serverless
Stars: 136 Issues: 1 Forks: 15
https://github.com/kubewharf/kubezoo
GitHub
GitHub - kubewharf/kubezoo: a lightweight kubernetes multi-tenancy gateway
a lightweight kubernetes multi-tenancy gateway. Contribute to kubewharf/kubezoo development by creating an account on GitHub.
π2π₯°1
jfversluis/learn-dotnet-maui
A repository filled with resources available to you to start learning or deepen your knowledge about .NET MAUI
#cross_platform #dotnet_for_android #dotnet_for_ios #dotnet_maui #maui #multi_platform_app_ui #xamarin #xamarin_forms
Stars: 135 Issues: 0 Forks: 8
https://github.com/jfversluis/learn-dotnet-maui
A repository filled with resources available to you to start learning or deepen your knowledge about .NET MAUI
#cross_platform #dotnet_for_android #dotnet_for_ios #dotnet_maui #maui #multi_platform_app_ui #xamarin #xamarin_forms
Stars: 135 Issues: 0 Forks: 8
https://github.com/jfversluis/learn-dotnet-maui
GitHub
GitHub - jfversluis/learn-dotnet-maui: A repository filled with resources available to you to start learning or deepen your knowledgeβ¦
A repository filled with resources available to you to start learning or deepen your knowledge about .NET MAUI - jfversluis/learn-dotnet-maui
π6π€1
NVlabs/prismer
The implementation of "Prismer: A Vision-Language Model with An Ensemble of Experts".
Language: Python
#image_captioning #language_model #multi_modal_learning #multi_task_learning #vision_and_language #vision_language_model #vqa
Stars: 479 Issues: 6 Forks: 21
https://github.com/NVlabs/prismer
The implementation of "Prismer: A Vision-Language Model with An Ensemble of Experts".
Language: Python
#image_captioning #language_model #multi_modal_learning #multi_task_learning #vision_and_language #vision_language_model #vqa
Stars: 479 Issues: 6 Forks: 21
https://github.com/NVlabs/prismer
GitHub
GitHub - NVlabs/prismer: The implementation of "Prismer: A Vision-Language Model with Multi-Task Experts".
The implementation of "Prismer: A Vision-Language Model with Multi-Task Experts". - NVlabs/prismer
π₯3
kyegomez/Sophia
Effortless plugin and play Optimizer to cut model training costs by 50%. New optimizer that is 2x faster than Adam on LLMs.
Language: Python
#artificial_intelligence #chatgpt #deep_learning #multi_modality #neural_network #optimizer
Stars: 229 Issues: 11 Forks: 16
https://github.com/kyegomez/Sophia
Effortless plugin and play Optimizer to cut model training costs by 50%. New optimizer that is 2x faster than Adam on LLMs.
Language: Python
#artificial_intelligence #chatgpt #deep_learning #multi_modality #neural_network #optimizer
Stars: 229 Issues: 11 Forks: 16
https://github.com/kyegomez/Sophia
GitHub
GitHub - kyegomez/Sophia: Effortless plugin and play Optimizer to cut model training costs by 50%. New optimizer that is 2x fasterβ¦
Effortless plugin and play Optimizer to cut model training costs by 50%. New optimizer that is 2x faster than Adam on LLMs. - kyegomez/Sophia
netease-youdao/EmotiVoice
EmotiVoice π: a Multi-Voice and Prompt-Controlled TTS Engine
Language: Python
#ai #deep_learning #emotion #emotivoice #multi_speaker #prompt #python #pytorch #speech #speech_synthesis #style #text_to_speech #tts
Stars: 432 Issues: 3 Forks: 38
https://github.com/netease-youdao/EmotiVoice
EmotiVoice π: a Multi-Voice and Prompt-Controlled TTS Engine
Language: Python
#ai #deep_learning #emotion #emotivoice #multi_speaker #prompt #python #pytorch #speech #speech_synthesis #style #text_to_speech #tts
Stars: 432 Issues: 3 Forks: 38
https://github.com/netease-youdao/EmotiVoice
GitHub
GitHub - netease-youdao/EmotiVoice: EmotiVoice π: a Multi-Voice and Prompt-Controlled TTS Engine
EmotiVoice π: a Multi-Voice and Prompt-Controlled TTS Engine - netease-youdao/EmotiVoice
π1
ixartz/SaaS-Boilerplate
πππ SaaS Boilerplate built with Next.js + Tailwind CSS + Shadcn UI + TypeScript. β‘οΈ Full-stack React application with Auth, Multi-tenancy, Roles & Permissions, i18n, Landing Page, DB, Logging, Testing
Language: TypeScript
#authentication #boilerplate #multi_tenancy #nextjs #react #reactjs #saas #saas_app #saas_application #saas_boilerplate #saas_kit #shadcn_ui #stack #starter #starter_kit #starter_project #starter_template #template #template_project #typescript
Stars: 634 Issues: 0 Forks: 75
https://github.com/ixartz/SaaS-Boilerplate
πππ SaaS Boilerplate built with Next.js + Tailwind CSS + Shadcn UI + TypeScript. β‘οΈ Full-stack React application with Auth, Multi-tenancy, Roles & Permissions, i18n, Landing Page, DB, Logging, Testing
Language: TypeScript
#authentication #boilerplate #multi_tenancy #nextjs #react #reactjs #saas #saas_app #saas_application #saas_boilerplate #saas_kit #shadcn_ui #stack #starter #starter_kit #starter_project #starter_template #template #template_project #typescript
Stars: 634 Issues: 0 Forks: 75
https://github.com/ixartz/SaaS-Boilerplate
GitHub
GitHub - ixartz/SaaS-Boilerplate: πππ SaaS Boilerplate built with Next.js + Tailwind CSS + Shadcn UI + TypeScript. β‘οΈ Full-stackβ¦
πππ SaaS Boilerplate built with Next.js + Tailwind CSS + Shadcn UI + TypeScript. β‘οΈ Full-stack React application with Auth, Multi-tenancy, Roles & Permissions, i18n, Landing Page, DB, Loggi...
π₯6π2π1
InternLM/MindSearch
π a LLM-based Multi-agent Framework of Web Search Engine similar to Perplexity.ai Pro and SearchGPT
Language: Python
#ai_search_engine #gpt #llm #llms #multi_agent_systems #perplexity_ai #search #searchgpt #transformer #web_search
Stars: 792 Issues: 9 Forks: 60
https://github.com/InternLM/MindSearch
π a LLM-based Multi-agent Framework of Web Search Engine similar to Perplexity.ai Pro and SearchGPT
Language: Python
#ai_search_engine #gpt #llm #llms #multi_agent_systems #perplexity_ai #search #searchgpt #transformer #web_search
Stars: 792 Issues: 9 Forks: 60
https://github.com/InternLM/MindSearch
GitHub
GitHub - InternLM/MindSearch: π An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)
π An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT) - InternLM/MindSearch
cvg/depthsplat
DepthSplat: Connecting Gaussian Splatting and Depth
Language: Python
#feed_forward_gaussian_splatting #monocular_depth #multi_view_stereo #view_synthesis
Stars: 318 Issues: 8 Forks: 9
https://github.com/cvg/depthsplat
DepthSplat: Connecting Gaussian Splatting and Depth
Language: Python
#feed_forward_gaussian_splatting #monocular_depth #multi_view_stereo #view_synthesis
Stars: 318 Issues: 8 Forks: 9
https://github.com/cvg/depthsplat
GitHub
GitHub - cvg/depthsplat: [CVPR'25] DepthSplat: Connecting Gaussian Splatting and Depth
[CVPR'25] DepthSplat: Connecting Gaussian Splatting and Depth - cvg/depthsplat
HKUDS/VideoRAG
"VideoRAG: Retrieval-Augmented Generation with Extreme Long-Context Videos"
Language: Python
#large_language_models #llms #long_video_understanding #multi_modal_llms #rag #retrieval_augmented_generation
Stars: 201 Issues: 1 Forks: 14
https://github.com/HKUDS/VideoRAG
"VideoRAG: Retrieval-Augmented Generation with Extreme Long-Context Videos"
Language: Python
#large_language_models #llms #long_video_understanding #multi_modal_llms #rag #retrieval_augmented_generation
Stars: 201 Issues: 1 Forks: 14
https://github.com/HKUDS/VideoRAG
GitHub
GitHub - HKUDS/VideoRAG: "VideoRAG: Retrieval-Augmented Generation with Extreme Long-Context Videos"
"VideoRAG: Retrieval-Augmented Generation with Extreme Long-Context Videos" - HKUDS/VideoRAG
β‘1π1
therealoliver/Deepdive-llama3-from-scratch
Achieve the llama3 inference step-by-step, grasp the core concepts, master the process derivation, implement the code.
Language: Jupyter Notebook
#attention #attention_mechanism #gpt #inference #kv_cache #language_model #llama #llm_configuration #llms #mask #multi_head_attention #positional_encoding #residuals #rms #rms_norm #rope #rotary_position_encoding #swiglu #tokenizer #transformer
Stars: 388 Issues: 0 Forks: 28
https://github.com/therealoliver/Deepdive-llama3-from-scratch
Achieve the llama3 inference step-by-step, grasp the core concepts, master the process derivation, implement the code.
Language: Jupyter Notebook
#attention #attention_mechanism #gpt #inference #kv_cache #language_model #llama #llm_configuration #llms #mask #multi_head_attention #positional_encoding #residuals #rms #rms_norm #rope #rotary_position_encoding #swiglu #tokenizer #transformer
Stars: 388 Issues: 0 Forks: 28
https://github.com/therealoliver/Deepdive-llama3-from-scratch
GitHub
GitHub - therealoliver/Deepdive-llama3-from-scratch: Achieve the llama3 inference step-by-step, grasp the core concepts, masterβ¦
Achieve the llama3 inference step-by-step, grasp the core concepts, master the process derivation, implement the code. - therealoliver/Deepdive-llama3-from-scratch
π1
ibelick/zola
Zola is a free, open-source AI chat app with multi-model support.
Language: TypeScript
#ai #chat #multi_model #nextjs #open_source #prompt_kit #shadcn_ui #supabase #typescript
Stars: 262 Issues: 3 Forks: 41
https://github.com/ibelick/zola
Zola is a free, open-source AI chat app with multi-model support.
Language: TypeScript
#ai #chat #multi_model #nextjs #open_source #prompt_kit #shadcn_ui #supabase #typescript
Stars: 262 Issues: 3 Forks: 41
https://github.com/ibelick/zola
GitHub
GitHub - ibelick/zola: Open chat interface for all your models.
Open chat interface for all your models. Contribute to ibelick/zola development by creating an account on GitHub.
π1
ses4255/Versatile-OCR-Program
Multi-modal OCR pipeline optimized for ML training (text, figure, math, tables, diagrams)
Language: Python
#doclayout #educational_data #exam_ocr #machine_learning #ml_datasets #multi_modal #ocr #openai #paper_ocr #table_parsing
Stars: 250 Issues: 0 Forks: 11
https://github.com/ses4255/Versatile-OCR-Program
Multi-modal OCR pipeline optimized for ML training (text, figure, math, tables, diagrams)
Language: Python
#doclayout #educational_data #exam_ocr #machine_learning #ml_datasets #multi_modal #ocr #openai #paper_ocr #table_parsing
Stars: 250 Issues: 0 Forks: 11
https://github.com/ses4255/Versatile-OCR-Program
GitHub
GitHub - ses4255/Versatile-OCR-Program: Multi-modal OCR pipeline optimized for ML training (text, figure, math, tables, diagrams)
Multi-modal OCR pipeline optimized for ML training (text, figure, math, tables, diagrams) - ses4255/Versatile-OCR-Program
β€1π1
bytedance/deer-flow
DeerFlow is a community-driven framework for deep research, combining language models with tools like web search, crawling, and Python execution, while contributing back to the open-source community.
Language: TypeScript
#agent #agentic #agentic_framework #agentic_workflow #ai #ai_agents #bytedance #deep_research #langchain #langgraph #langmanus #llm #multi_agent #nodejs #podcast #python #typescript
Stars: 661 Issues: 4 Forks: 59
https://github.com/bytedance/deer-flow
DeerFlow is a community-driven framework for deep research, combining language models with tools like web search, crawling, and Python execution, while contributing back to the open-source community.
Language: TypeScript
#agent #agentic #agentic_framework #agentic_workflow #ai #ai_agents #bytedance #deep_research #langchain #langgraph #langmanus #llm #multi_agent #nodejs #podcast #python #typescript
Stars: 661 Issues: 4 Forks: 59
https://github.com/bytedance/deer-flow
GitHub
GitHub - bytedance/deer-flow: DeerFlow is a community-driven Deep Research framework, combining language models with tools likeβ¦
DeerFlow is a community-driven Deep Research framework, combining language models with tools like web search, crawling, and Python execution, while contributing back to the open-source community. -...
strands-agents/sdk-python
A model-driven approach to building AI agents in just a few lines of code.
Language: Python
#agentic #agentic_ai #agents #ai #anthropic #autonomous_agents #genai #litellm #llm #machine_learning #mcp #multi_agent_systems #ollama #opentelemetry #python
Stars: 217 Issues: 9 Forks: 23
https://github.com/strands-agents/sdk-python
A model-driven approach to building AI agents in just a few lines of code.
Language: Python
#agentic #agentic_ai #agents #ai #anthropic #autonomous_agents #genai #litellm #llm #machine_learning #mcp #multi_agent_systems #ollama #opentelemetry #python
Stars: 217 Issues: 9 Forks: 23
https://github.com/strands-agents/sdk-python
GitHub
GitHub - strands-agents/sdk-python: A model-driven approach to building AI agents in just a few lines of code.
A model-driven approach to building AI agents in just a few lines of code. - strands-agents/sdk-python
NirDiamant/agents-towards-production
This repository delivers end-to-end, code-first tutorials covering every layer of production-grade GenAI agents, guiding you from spark to scale with proven patterns and reusable blueprints for real-world launches.
Language: Jupyter Notebook
#agent #agent_framework #agents #ai_agents #genai #generative_ai #llm #llms #mlops #multi_agent #production #tool_integration #tutorials
Stars: 1422 Issues: 0 Forks: 141
https://github.com/NirDiamant/agents-towards-production
This repository delivers end-to-end, code-first tutorials covering every layer of production-grade GenAI agents, guiding you from spark to scale with proven patterns and reusable blueprints for real-world launches.
Language: Jupyter Notebook
#agent #agent_framework #agents #ai_agents #genai #generative_ai #llm #llms #mlops #multi_agent #production #tool_integration #tutorials
Stars: 1422 Issues: 0 Forks: 141
https://github.com/NirDiamant/agents-towards-production
GitHub
GitHub - NirDiamant/agents-towards-production: This repository delivers end-to-end, code-first tutorials covering every layer ofβ¦
This repository delivers end-to-end, code-first tutorials covering every layer of production-grade GenAI agents, guiding you from spark to scale with proven patterns and reusable blueprints for re...
β€2
NVlabs/Long-RL
Long-RL: Scaling RL to Long Sequences
Language: Python
#efficient_ai #large_language_models #long_sequence #multi_modality #reinforcement_learning #sequence_parallelism
Stars: 301 Issues: 2 Forks: 3
https://github.com/NVlabs/Long-RL
Long-RL: Scaling RL to Long Sequences
Language: Python
#efficient_ai #large_language_models #long_sequence #multi_modality #reinforcement_learning #sequence_parallelism
Stars: 301 Issues: 2 Forks: 3
https://github.com/NVlabs/Long-RL
GitHub
GitHub - NVlabs/Long-RL: Long-RL: Scaling RL to Long Sequences
Long-RL: Scaling RL to Long Sequences. Contribute to NVlabs/Long-RL development by creating an account on GitHub.