ifzhang/FairMOT
A simple baseline for one-shot multi-object tracking
Language: Python
#joint_detection_and_tracking #multi_object_tracking #one_shot_tracker #real_time
Stars: 152 Issues: 0 Forks: 20
https://github.com/ifzhang/FairMOT
A simple baseline for one-shot multi-object tracking
Language: Python
#joint_detection_and_tracking #multi_object_tracking #one_shot_tracker #real_time
Stars: 152 Issues: 0 Forks: 20
https://github.com/ifzhang/FairMOT
GitHub
GitHub - ifzhang/FairMOT: [IJCV-2021] FairMOT: On the Fairness of Detection and Re-Identification in Multi-Object Tracking
[IJCV-2021] FairMOT: On the Fairness of Detection and Re-Identification in Multi-Object Tracking - ifzhang/FairMOT
lucidrains/deep-daze
Simple command line tool for text to image generation using OpenAI's CLIP and Siren (Implicit neural representation network)
Language: Python
#artificial_intelligence #deep_learning #implicit_neural_representation #multi_modality #siren #text_to_image #transformers
Stars: 127 Issues: 5 Forks: 11
https://github.com/lucidrains/deep-daze
Simple command line tool for text to image generation using OpenAI's CLIP and Siren (Implicit neural representation network)
Language: Python
#artificial_intelligence #deep_learning #implicit_neural_representation #multi_modality #siren #text_to_image #transformers
Stars: 127 Issues: 5 Forks: 11
https://github.com/lucidrains/deep-daze
GitHub
GitHub - lucidrains/deep-daze: Simple command line tool for text to image generation using OpenAI's CLIP and Siren (Implicit neural…
Simple command line tool for text to image generation using OpenAI's CLIP and Siren (Implicit neural representation network). Technique was originally created by https://twitter.com/advadno...
OFA-Sys/Chinese-CLIP
Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.
Language: Python
#chinese #computer_vision #multi_modal_learning #nlp #pytorch #vision_and_language_pre_training
Stars: 80 Issues: 0 Forks: 7
https://github.com/OFA-Sys/Chinese-CLIP
Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.
Language: Python
#chinese #computer_vision #multi_modal_learning #nlp #pytorch #vision_and_language_pre_training
Stars: 80 Issues: 0 Forks: 7
https://github.com/OFA-Sys/Chinese-CLIP
GitHub
GitHub - OFA-Sys/Chinese-CLIP: Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.
Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation. - OFA-Sys/Chinese-CLIP
MasterBin-IIAU/Unicorn
[ECCV'22 Oral] Towards Grand Unification of Object Tracking
Language: Python
#multi_object_tracking_segmentation #multiple_object_tracking #object_tracking #single_object_tracking #video_object_segmentation
Stars: 132 Issues: 1 Forks: 5
https://github.com/MasterBin-IIAU/Unicorn
[ECCV'22 Oral] Towards Grand Unification of Object Tracking
Language: Python
#multi_object_tracking_segmentation #multiple_object_tracking #object_tracking #single_object_tracking #video_object_segmentation
Stars: 132 Issues: 1 Forks: 5
https://github.com/MasterBin-IIAU/Unicorn
GitHub
GitHub - MasterBin-IIAU/Unicorn: [ECCV'22 Oral] Towards Grand Unification of Object Tracking
[ECCV'22 Oral] Towards Grand Unification of Object Tracking - MasterBin-IIAU/Unicorn
kubewharf/kubezoo
a lightweight kubernetes multi-tenancy gateway
Language: Go
#kubernetes #multi_tenancy #serverless
Stars: 136 Issues: 1 Forks: 15
https://github.com/kubewharf/kubezoo
a lightweight kubernetes multi-tenancy gateway
Language: Go
#kubernetes #multi_tenancy #serverless
Stars: 136 Issues: 1 Forks: 15
https://github.com/kubewharf/kubezoo
GitHub
GitHub - kubewharf/kubezoo: a lightweight kubernetes multi-tenancy gateway
a lightweight kubernetes multi-tenancy gateway. Contribute to kubewharf/kubezoo development by creating an account on GitHub.
jfversluis/learn-dotnet-maui
A repository filled with resources available to you to start learning or deepen your knowledge about .NET MAUI
#cross_platform #dotnet_for_android #dotnet_for_ios #dotnet_maui #maui #multi_platform_app_ui #xamarin #xamarin_forms
Stars: 135 Issues: 0 Forks: 8
https://github.com/jfversluis/learn-dotnet-maui
A repository filled with resources available to you to start learning or deepen your knowledge about .NET MAUI
#cross_platform #dotnet_for_android #dotnet_for_ios #dotnet_maui #maui #multi_platform_app_ui #xamarin #xamarin_forms
Stars: 135 Issues: 0 Forks: 8
https://github.com/jfversluis/learn-dotnet-maui
GitHub
GitHub - jfversluis/learn-dotnet-maui: A repository filled with resources available to you to start learning or deepen your knowledge…
A repository filled with resources available to you to start learning or deepen your knowledge about .NET MAUI - jfversluis/learn-dotnet-maui
NVlabs/prismer
The implementation of "Prismer: A Vision-Language Model with An Ensemble of Experts".
Language: Python
#image_captioning #language_model #multi_modal_learning #multi_task_learning #vision_and_language #vision_language_model #vqa
Stars: 479 Issues: 6 Forks: 21
https://github.com/NVlabs/prismer
The implementation of "Prismer: A Vision-Language Model with An Ensemble of Experts".
Language: Python
#image_captioning #language_model #multi_modal_learning #multi_task_learning #vision_and_language #vision_language_model #vqa
Stars: 479 Issues: 6 Forks: 21
https://github.com/NVlabs/prismer
GitHub
GitHub - NVlabs/prismer: The implementation of "Prismer: A Vision-Language Model with Multi-Task Experts".
The implementation of "Prismer: A Vision-Language Model with Multi-Task Experts". - NVlabs/prismer
kyegomez/Sophia
Effortless plugin and play Optimizer to cut model training costs by 50%. New optimizer that is 2x faster than Adam on LLMs.
Language: Python
#artificial_intelligence #chatgpt #deep_learning #multi_modality #neural_network #optimizer
Stars: 229 Issues: 11 Forks: 16
https://github.com/kyegomez/Sophia
Effortless plugin and play Optimizer to cut model training costs by 50%. New optimizer that is 2x faster than Adam on LLMs.
Language: Python
#artificial_intelligence #chatgpt #deep_learning #multi_modality #neural_network #optimizer
Stars: 229 Issues: 11 Forks: 16
https://github.com/kyegomez/Sophia
GitHub
GitHub - kyegomez/Sophia: Effortless plugin and play Optimizer to cut model training costs by 50%. New optimizer that is 2x faster…
Effortless plugin and play Optimizer to cut model training costs by 50%. New optimizer that is 2x faster than Adam on LLMs. - kyegomez/Sophia
netease-youdao/EmotiVoice
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
Language: Python
#ai #deep_learning #emotion #emotivoice #multi_speaker #prompt #python #pytorch #speech #speech_synthesis #style #text_to_speech #tts
Stars: 432 Issues: 3 Forks: 38
https://github.com/netease-youdao/EmotiVoice
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
Language: Python
#ai #deep_learning #emotion #emotivoice #multi_speaker #prompt #python #pytorch #speech #speech_synthesis #style #text_to_speech #tts
Stars: 432 Issues: 3 Forks: 38
https://github.com/netease-youdao/EmotiVoice
GitHub
GitHub - netease-youdao/EmotiVoice: EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine - netease-youdao/EmotiVoice
ixartz/SaaS-Boilerplate
🚀🎉📚 SaaS Boilerplate built with Next.js + Tailwind CSS + Shadcn UI + TypeScript. ⚡️ Full-stack React application with Auth, Multi-tenancy, Roles & Permissions, i18n, Landing Page, DB, Logging, Testing
Language: TypeScript
#authentication #boilerplate #multi_tenancy #nextjs #react #reactjs #saas #saas_app #saas_application #saas_boilerplate #saas_kit #shadcn_ui #stack #starter #starter_kit #starter_project #starter_template #template #template_project #typescript
Stars: 634 Issues: 0 Forks: 75
https://github.com/ixartz/SaaS-Boilerplate
🚀🎉📚 SaaS Boilerplate built with Next.js + Tailwind CSS + Shadcn UI + TypeScript. ⚡️ Full-stack React application with Auth, Multi-tenancy, Roles & Permissions, i18n, Landing Page, DB, Logging, Testing
Language: TypeScript
#authentication #boilerplate #multi_tenancy #nextjs #react #reactjs #saas #saas_app #saas_application #saas_boilerplate #saas_kit #shadcn_ui #stack #starter #starter_kit #starter_project #starter_template #template #template_project #typescript
Stars: 634 Issues: 0 Forks: 75
https://github.com/ixartz/SaaS-Boilerplate
GitHub
GitHub - ixartz/SaaS-Boilerplate: 🚀🎉📚 SaaS Boilerplate built with Next.js + Tailwind CSS + Shadcn UI + TypeScript. ⚡️ Full-stack…
🚀🎉📚 SaaS Boilerplate built with Next.js + Tailwind CSS + Shadcn UI + TypeScript. ⚡️ Full-stack React application with Auth, Multi-tenancy, Roles & Permissions, i18n, Landing Page, DB, Loggi...
InternLM/MindSearch
🔍 a LLM-based Multi-agent Framework of Web Search Engine similar to Perplexity.ai Pro and SearchGPT
Language: Python
#ai_search_engine #gpt #llm #llms #multi_agent_systems #perplexity_ai #search #searchgpt #transformer #web_search
Stars: 792 Issues: 9 Forks: 60
https://github.com/InternLM/MindSearch
🔍 a LLM-based Multi-agent Framework of Web Search Engine similar to Perplexity.ai Pro and SearchGPT
Language: Python
#ai_search_engine #gpt #llm #llms #multi_agent_systems #perplexity_ai #search #searchgpt #transformer #web_search
Stars: 792 Issues: 9 Forks: 60
https://github.com/InternLM/MindSearch
GitHub
GitHub - InternLM/MindSearch: 🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)
🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT) - InternLM/MindSearch
cvg/depthsplat
DepthSplat: Connecting Gaussian Splatting and Depth
Language: Python
#feed_forward_gaussian_splatting #monocular_depth #multi_view_stereo #view_synthesis
Stars: 318 Issues: 8 Forks: 9
https://github.com/cvg/depthsplat
DepthSplat: Connecting Gaussian Splatting and Depth
Language: Python
#feed_forward_gaussian_splatting #monocular_depth #multi_view_stereo #view_synthesis
Stars: 318 Issues: 8 Forks: 9
https://github.com/cvg/depthsplat
GitHub
GitHub - cvg/depthsplat: [CVPR'25] DepthSplat: Connecting Gaussian Splatting and Depth
[CVPR'25] DepthSplat: Connecting Gaussian Splatting and Depth - cvg/depthsplat
HKUDS/VideoRAG
"VideoRAG: Retrieval-Augmented Generation with Extreme Long-Context Videos"
Language: Python
#large_language_models #llms #long_video_understanding #multi_modal_llms #rag #retrieval_augmented_generation
Stars: 201 Issues: 1 Forks: 14
https://github.com/HKUDS/VideoRAG
"VideoRAG: Retrieval-Augmented Generation with Extreme Long-Context Videos"
Language: Python
#large_language_models #llms #long_video_understanding #multi_modal_llms #rag #retrieval_augmented_generation
Stars: 201 Issues: 1 Forks: 14
https://github.com/HKUDS/VideoRAG
GitHub
GitHub - HKUDS/VideoRAG: "VideoRAG: Retrieval-Augmented Generation with Extreme Long-Context Videos"
"VideoRAG: Retrieval-Augmented Generation with Extreme Long-Context Videos" - HKUDS/VideoRAG
therealoliver/Deepdive-llama3-from-scratch
Achieve the llama3 inference step-by-step, grasp the core concepts, master the process derivation, implement the code.
Language: Jupyter Notebook
#attention #attention_mechanism #gpt #inference #kv_cache #language_model #llama #llm_configuration #llms #mask #multi_head_attention #positional_encoding #residuals #rms #rms_norm #rope #rotary_position_encoding #swiglu #tokenizer #transformer
Stars: 388 Issues: 0 Forks: 28
https://github.com/therealoliver/Deepdive-llama3-from-scratch
Achieve the llama3 inference step-by-step, grasp the core concepts, master the process derivation, implement the code.
Language: Jupyter Notebook
#attention #attention_mechanism #gpt #inference #kv_cache #language_model #llama #llm_configuration #llms #mask #multi_head_attention #positional_encoding #residuals #rms #rms_norm #rope #rotary_position_encoding #swiglu #tokenizer #transformer
Stars: 388 Issues: 0 Forks: 28
https://github.com/therealoliver/Deepdive-llama3-from-scratch
GitHub
GitHub - therealoliver/Deepdive-llama3-from-scratch: Achieve the llama3 inference step-by-step, grasp the core concepts, master…
Achieve the llama3 inference step-by-step, grasp the core concepts, master the process derivation, implement the code. - therealoliver/Deepdive-llama3-from-scratch
ibelick/zola
Zola is a free, open-source AI chat app with multi-model support.
Language: TypeScript
#ai #chat #multi_model #nextjs #open_source #prompt_kit #shadcn_ui #supabase #typescript
Stars: 262 Issues: 3 Forks: 41
https://github.com/ibelick/zola
Zola is a free, open-source AI chat app with multi-model support.
Language: TypeScript
#ai #chat #multi_model #nextjs #open_source #prompt_kit #shadcn_ui #supabase #typescript
Stars: 262 Issues: 3 Forks: 41
https://github.com/ibelick/zola
GitHub
GitHub - ibelick/zola: Open chat interface for all your models.
Open chat interface for all your models. Contribute to ibelick/zola development by creating an account on GitHub.
ses4255/Versatile-OCR-Program
Multi-modal OCR pipeline optimized for ML training (text, figure, math, tables, diagrams)
Language: Python
#doclayout #educational_data #exam_ocr #machine_learning #ml_datasets #multi_modal #ocr #openai #paper_ocr #table_parsing
Stars: 250 Issues: 0 Forks: 11
https://github.com/ses4255/Versatile-OCR-Program
Multi-modal OCR pipeline optimized for ML training (text, figure, math, tables, diagrams)
Language: Python
#doclayout #educational_data #exam_ocr #machine_learning #ml_datasets #multi_modal #ocr #openai #paper_ocr #table_parsing
Stars: 250 Issues: 0 Forks: 11
https://github.com/ses4255/Versatile-OCR-Program
GitHub
GitHub - ses4255/Versatile-OCR-Program: Multi-modal OCR pipeline optimized for ML training (text, figure, math, tables, diagrams)
Multi-modal OCR pipeline optimized for ML training (text, figure, math, tables, diagrams) - ses4255/Versatile-OCR-Program
bytedance/deer-flow
DeerFlow is a community-driven framework for deep research, combining language models with tools like web search, crawling, and Python execution, while contributing back to the open-source community.
Language: TypeScript
#agent #agentic #agentic_framework #agentic_workflow #ai #ai_agents #bytedance #deep_research #langchain #langgraph #langmanus #llm #multi_agent #nodejs #podcast #python #typescript
Stars: 661 Issues: 4 Forks: 59
https://github.com/bytedance/deer-flow
DeerFlow is a community-driven framework for deep research, combining language models with tools like web search, crawling, and Python execution, while contributing back to the open-source community.
Language: TypeScript
#agent #agentic #agentic_framework #agentic_workflow #ai #ai_agents #bytedance #deep_research #langchain #langgraph #langmanus #llm #multi_agent #nodejs #podcast #python #typescript
Stars: 661 Issues: 4 Forks: 59
https://github.com/bytedance/deer-flow
GitHub
GitHub - bytedance/deer-flow: DeerFlow is a community-driven Deep Research framework, combining language models with tools like…
DeerFlow is a community-driven Deep Research framework, combining language models with tools like web search, crawling, and Python execution, while contributing back to the open-source community. -...
strands-agents/sdk-python
A model-driven approach to building AI agents in just a few lines of code.
Language: Python
#agentic #agentic_ai #agents #ai #anthropic #autonomous_agents #genai #litellm #llm #machine_learning #mcp #multi_agent_systems #ollama #opentelemetry #python
Stars: 217 Issues: 9 Forks: 23
https://github.com/strands-agents/sdk-python
A model-driven approach to building AI agents in just a few lines of code.
Language: Python
#agentic #agentic_ai #agents #ai #anthropic #autonomous_agents #genai #litellm #llm #machine_learning #mcp #multi_agent_systems #ollama #opentelemetry #python
Stars: 217 Issues: 9 Forks: 23
https://github.com/strands-agents/sdk-python
GitHub
GitHub - strands-agents/sdk-python: A model-driven approach to building AI agents in just a few lines of code.
A model-driven approach to building AI agents in just a few lines of code. - strands-agents/sdk-python