Fundus: A Simple-to-Use News Scraper Optimized for High Quality Extractions
🖥https://github.com/flairnlp/fundus
🖥https://github.com/flairnlp/fundus
GitHub
GitHub - flairNLP/fundus: A very simple news crawler with a funny name
A very simple news crawler with a funny name. Contribute to flairNLP/fundus development by creating an account on GitHub.
VGGSfM: Visual Geometry Grounded Deep Structure From Motion
🖥https://github.com/facebookresearch/vggsfm
🖥https://github.com/facebookresearch/vggsfm
GitHub
GitHub - facebookresearch/vggsfm: VGGSfM: Visual Geometry Grounded Deep Structure From Motion
VGGSfM: Visual Geometry Grounded Deep Structure From Motion - facebookresearch/vggsfm
DataComp-LM: In search of the next generation of training sets for language models
🖥https://github.com/mlfoundations/dclm
🖥https://github.com/mlfoundations/dclm
GitHub
GitHub - mlfoundations/dclm: DataComp for Language Models
DataComp for Language Models. Contribute to mlfoundations/dclm development by creating an account on GitHub.
OV-DINO: Unified Open-Vocabulary Detection with Language-Aware Selective Fusion
🖥https://github.com/wanghao9610/ov-dino
🖥https://github.com/wanghao9610/ov-dino
GitHub
GitHub - wanghao9610/OV-DINO: Official implementation of OV-DINO: Unified Open-Vocabulary Detection with Language-Aware Selective…
Official implementation of OV-DINO: Unified Open-Vocabulary Detection with Language-Aware Selective Fusion - wanghao9610/OV-DINO
"Do Anything Now": Characterizing and Evaluating In-The-Wild Jailbreak Prompts on Large Language Models
🖥https://github.com/verazuo/jailbreak_llms
🖥https://github.com/verazuo/jailbreak_llms
GitHub
GitHub - verazuo/jailbreak_llms: [CCS'24] A dataset consists of 15,140 ChatGPT prompts from Reddit, Discord, websites, and open…
[CCS'24] A dataset consists of 15,140 ChatGPT prompts from Reddit, Discord, websites, and open-source datasets (including 1,405 jailbreak prompts). - verazuo/jailbreak_llms
LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders
🖥https://github.com/mcgill-nlp/llm2vec
🖥https://github.com/mcgill-nlp/llm2vec
GitHub
GitHub - McGill-NLP/llm2vec: Code for 'LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders'
Code for 'LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders' - McGill-NLP/llm2vec
Arbitrary-Scale Video Super-Resolution with Structural and Textural Priors
🖥https://github.com/shangwei5/st-avsr
🖥https://github.com/shangwei5/st-avsr
GitHub
GitHub - shangwei5/ST-AVSR: Arbitrary-Scale Video Super-Resolution with Structural and Textural Priors (ECCV2024)
Arbitrary-Scale Video Super-Resolution with Structural and Textural Priors (ECCV2024) - shangwei5/ST-AVSR
Neural General Circulation Models for Weather and Climate
🖥https://github.com/google-research/neuralgcm
🖥https://github.com/google-research/neuralgcm
GitHub
GitHub - neuralgcm/neuralgcm: Hybrid ML + physics model of the Earth's atmosphere
Hybrid ML + physics model of the Earth's atmosphere - neuralgcm/neuralgcm
CogVideo: Large-scale Pretraining for Text-to-Video Generation via Transformers
🖥https://github.com/thudm/cogvideo
🖥https://github.com/thudm/cogvideo
GitHub
GitHub - THUDM/CogVideo: text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023) - THUDM/CogVideo
Scaling Proprioceptive-Visual Learning with Heterogeneous Pre-trained Transformers
🖥https://github.com/liruiw/HPT
🖥https://github.com/liruiw/HPT
GitHub
GitHub - liruiw/HPT: Heterogeneous Pre-trained Transformer (HPT) as Scalable Policy Learner.
Heterogeneous Pre-trained Transformer (HPT) as Scalable Policy Learner. - liruiw/HPT
TensorIR: An Abstraction for Automatic Tensorized Program Optimization
🖥https://github.com/mlc-ai/web-llm
🖥https://github.com/mlc-ai/web-llm
GitHub
GitHub - mlc-ai/web-llm: High-performance In-browser LLM Inference Engine
High-performance In-browser LLM Inference Engine . Contribute to mlc-ai/web-llm development by creating an account on GitHub.
VPTQ: Extreme Low-bit Vector Post-Training Quantization for Large Language Models
🖥https://github.com/microsoft/vptq
🖥https://github.com/microsoft/vptq
GitHub
GitHub - microsoft/VPTQ: VPTQ, A Flexible and Extreme low-bit quantization algorithm
VPTQ, A Flexible and Extreme low-bit quantization algorithm - microsoft/VPTQ
OmAgent: A Multi-modal Agent Framework for Complex Video Understanding with Task Divide-and-Conquer
🖥https://github.com/om-ai-lab/OmAgent
🖥https://github.com/om-ai-lab/OmAgent
GitHub
GitHub - om-ai-lab/OmAgent: Build multimodal language agents for fast prototype and production
Build multimodal language agents for fast prototype and production - om-ai-lab/OmAgent
WiLoR: End-to-end 3D Hand Localization and Reconstruction in-the-wild
🖥https://github.com/rolpotamias/WiLoR
🖥https://github.com/rolpotamias/WiLoR
GitHub
GitHub - rolpotamias/WiLoR: WiLoR: End-to-end 3D hand localization and reconstruction in-the-wild
WiLoR: End-to-end 3D hand localization and reconstruction in-the-wild - rolpotamias/WiLoR
Cross-video Identity Correlating for Person Re-identification Pre-training
🖥https://github.com/zplusdragon/cion_reidzoo
🖥https://github.com/zplusdragon/cion_reidzoo
GitHub
GitHub - Zplusdragon/CION_ReIDZoo: [NeurIPS2024] Cross-video Identity Correlating for Person Re-identification Pre-training
[NeurIPS2024] Cross-video Identity Correlating for Person Re-identification Pre-training - Zplusdragon/CION_ReIDZoo
ABQ-LLM: Arbitrary-Bit Quantized Inference Acceleration for Large Language Models
🖥https://github.com/bytedance/abq-llm
🖥https://github.com/bytedance/abq-llm
GitHub
GitHub - bytedance/ABQ-LLM: An acceleration library that supports arbitrary bit-width combinatorial quantization operations
An acceleration library that supports arbitrary bit-width combinatorial quantization operations - bytedance/ABQ-LLM
A visualization method for data domain changes in CNN networks and the optimization method for selecting thresholds in classification tasks
🖥https://github.com/Faceplugin-ltd/FaceLivenessDetection-Android
🖥https://github.com/Faceplugin-ltd/FaceLivenessDetection-Android
GitHub
GitHub - Faceplugin-ltd/FaceLivenessDetection-Android: Liveness detection SDK Android - iBeta level 2 compliant 3D passive liveness…
Liveness detection SDK Android - iBeta level 2 compliant 3D passive liveness detection engine which can detect printed photos, video replay, 3D masks, and deepfake threats - Faceplugin-ltd/FaceLiv...