ucbepic/docetl
A system for complex LLM-powered document processing
Language:Python
Total stars: 482
Stars trend:
#python
#data, #datapipelines, #elt, #etl, #llm, #python, #workflow
A system for complex LLM-powered document processing
Language:Python
Total stars: 482
Stars trend:
28 Sep 2024
7am ▌ +4
8am █▏ +9
9am ▋ +5
10am ▎ +2
11am █▎ +10
12pm ▌ +4
1pm █▏ +9
2pm ▊ +6
3pm █▎ +10
4pm █▏ +9
5pm ▌ +4
6pm ▍ +3
#python
#data, #datapipelines, #elt, #etl, #llm, #python, #workflow
Litlyx/litlyx
All-in-one Analytics Solution. Setup in 30 seconds. Display all your data on an AI-powered dashboard. Fully self-hostable and GDPR compliant.
Language:Vue
Total stars: 700
Stars trend:
#vue
#ai, #analytics, #angular, #charts, #data, #dataanalysis, #datavisualization, #javascript, #metrics, #nextjs, #nodejs, #nuxt, #opensource, #react, #statistics, #typescript, #vue, #website
All-in-one Analytics Solution. Setup in 30 seconds. Display all your data on an AI-powered dashboard. Fully self-hostable and GDPR compliant.
Language:Vue
Total stars: 700
Stars trend:
3 Nov 2024
9am ▉ +7
10am ██ +16
11am ██▎ +18
12pm █▉ +15
1pm ▋ +5
2pm █▉ +15
#vue
#ai, #analytics, #angular, #charts, #data, #dataanalysis, #datavisualization, #javascript, #metrics, #nextjs, #nodejs, #nuxt, #opensource, #react, #statistics, #typescript, #vue, #website
shayonj/pg_flo
Stream, transform, and route PostgreSQL data in real-time.
Language:Go
Total stars: 127
Stars trend:
#go
#data, #database, #etl, #go, #golang, #logicalreplication, #postgres, #postgresql, #stream
Stream, transform, and route PostgreSQL data in real-time.
Language:Go
Total stars: 127
Stars trend:
3 Nov 2024
5pm ▍ +3
6pm ▎ +2
7pm ██▉ +23
8pm ██▌ +20
9pm █▋ +13
10pm █▉ +15
#go
#data, #database, #etl, #go, #golang, #logicalreplication, #postgres, #postgresql, #stream
PrefectHQ/prefect
Prefect is a workflow orchestration framework for building resilient data pipelines in Python.
Language:Python
Total stars: 16541
Stars trend:
#python
#automation, #data, #dataengineering, #dataops, #datascience, #infrastructure, #mlops, #observability, #orchestration, #pipeline, #prefect, #python, #workflow, #workflowengine
Prefect is a workflow orchestration framework for building resilient data pipelines in Python.
Language:Python
Total stars: 16541
Stars trend:
9 Nov 2024
4am ▎ +2
5am ▏ +1
6am ▍ +3
7am ▏ +1
8am ▎ +2
9am ▏ +1
10am ▍ +3
11am ██▉ +23
12pm ███ +24
1pm ████▍ +35
#python
#automation, #data, #dataengineering, #dataops, #datascience, #infrastructure, #mlops, #observability, #orchestration, #pipeline, #prefect, #python, #workflow, #workflowengine
panel-extensions/panel-graphic-walker
A project providing a Graphic Walker Pane for use with HoloViz Panel.
Language:Python
Total stars: 124
Stars trend:
#python
#businessintelligence, #data, #dataanalysis, #dataapp, #dataexploration, #datamining, #datavisualization, #eda, #holovizpanel, #lowcode, #notebook, #pivottable, #python, #tableau, #tableaualternative, #vega, #vegalite, #visualization
A project providing a Graphic Walker Pane for use with HoloViz Panel.
Language:Python
Total stars: 124
Stars trend:
30 Dec 2024
2pm ▋ +5
3pm █▉ +15
4pm ██▊ +22
5pm ▋ +5
6pm █▉ +15
7pm █▋ +13
#python
#businessintelligence, #data, #dataanalysis, #dataapp, #dataexploration, #datamining, #datavisualization, #eda, #holovizpanel, #lowcode, #notebook, #pivottable, #python, #tableau, #tableaualternative, #vega, #vegalite, #visualization
pyper-dev/pyper
Concurrent Python made simple
Language:Python
Total stars: 136
Stars trend:
#python
#asyncio, #concurrency, #data, #datacollection, #dataengineering, #datapipelines, #dataprocessing, #multiprocessing, #parallelcomputing, #python, #threading
Concurrent Python made simple
Language:Python
Total stars: 136
Stars trend:
15 Jan 2025
2am █▌ +12
3am ██▊ +22
4am ██▊ +22
5am ██▋ +21
6am ██▏ +17
#python
#asyncio, #concurrency, #data, #datacollection, #dataengineering, #datapipelines, #dataprocessing, #multiprocessing, #parallelcomputing, #python, #threading
mendableai/firecrawl
🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.
Language:TypeScript
Total stars: 21906
Stars trend:
#typescript
#ai, #aiscraping, #crawler, #data, #htmltomarkdown, #llm, #markdown, #rag, #scraper, #scraping, #webcrawler, #webscraping
🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.
Language:TypeScript
Total stars: 21906
Stars trend:
20 Jan 2025
5pm ▏ +1
6pm ▎ +2
7pm ▉ +7
8pm ▊ +6
9pm ▉ +7
10pm █ +8
11pm ▋ +5
21 Jan 2025
12am ▉ +7
1am ▌ +4
2am ▍ +3
3am ██ +16
4am █▍ +11
#typescript
#ai, #aiscraping, #crawler, #data, #htmltomarkdown, #llm, #markdown, #rag, #scraper, #scraping, #webcrawler, #webscraping
DataExpert-io/data-engineer-handbook
This is a repo with links to everything you'd ever want to learn about data engineering
Language:Jupyter Notebook
Total stars: 24911
Stars trend:
#jupyternotebook
#apachespark, #awesome, #bigdata, #data, #dataengineering, #sql
This is a repo with links to everything you'd ever want to learn about data engineering
Language:Jupyter Notebook
Total stars: 24911
Stars trend:
22 Jan 2025
9am ▏ +1
10am ▊ +6
11am ▉ +7
12pm ▊ +6
1pm █▏ +9
2pm █▍ +11
3pm █ +8
4pm █▏ +9
5pm █ +8
6pm ▍ +3
7pm █ +8
#jupyternotebook
#apachespark, #awesome, #bigdata, #data, #dataengineering, #sql
D4Vinci/Scrapling
🕷️ Undetectable, Lightning-Fast, and Adaptive Web Scraping for Python
Language:Python
Total stars: 2301
Stars trend:
#python
#ai, #aiscraping, #automation, #crawler, #crawling, #crawlingpython, #data, #dataextraction, #hacktoberfest, #playwright, #python, #python3, #scraping, #selectors, #stealth, #webscraper, #webscraping, #webscrapingpython, #webscraping, #xpath
🕷️ Undetectable, Lightning-Fast, and Adaptive Web Scraping for Python
Language:Python
Total stars: 2301
Stars trend:
12 Feb 2025
6am ▍ +3
7am ▎ +2
8am ▋ +5
9am ▍ +3
10am ▊ +6
11am ▉ +7
12pm █▉ +15
1pm █▌ +12
2pm █▉ +15
3pm █▌ +12
4pm ▊ +6
5pm █▋ +13
#python
#ai, #aiscraping, #automation, #crawler, #crawling, #crawlingpython, #data, #dataextraction, #hacktoberfest, #playwright, #python, #python3, #scraping, #selectors, #stealth, #webscraper, #webscraping, #webscrapingpython, #webscraping, #xpath
sinaptik-ai/pandas-ai
Chat with your database or your datalake (SQL, CSV, parquet). PandasAI makes data analysis conversational using LLMs and RAG.
Language:Python
Total stars: 17113
Stars trend:
#python
#ai, #csv, #data, #dataanalysis, #datascience, #datavisualization, #database, #datalake, #gpt4, #llm, #pandas, #sql, #texttosql
Chat with your database or your datalake (SQL, CSV, parquet). PandasAI makes data analysis conversational using LLMs and RAG.
Language:Python
Total stars: 17113
Stars trend:
2 Mar 2025
5am ██ +16
6am █▍ +11
7am █▏ +9
8am ▉ +7
9am █ +8
10am ▉ +7
11am ▊ +6
12pm █ +8
1pm █ +8
2pm █▌ +12
3pm ▍ +3
4pm █▍ +11
#python
#ai, #csv, #data, #dataanalysis, #datascience, #datavisualization, #database, #datalake, #gpt4, #llm, #pandas, #sql, #texttosql
cocoindex-io/cocoindex
ETL framework to turn your data AI-ready - with realtime incremental updates and support custom logic like lego.
Language:Rust
Total stars: 672
Stars trend:
#rust
#ai, #changedatacapture, #data, #dataengineering, #dataindexing, #datainfrastructure, #dataprocessing, #dataflow, #etl, #helpwanted, #indexing, #knowledgegraph, #llm, #pipeline, #python, #rag, #realtime, #rust, #semanticsearch, #streaming
ETL framework to turn your data AI-ready - with realtime incremental updates and support custom logic like lego.
Language:Rust
Total stars: 672
Stars trend:
20 Apr 2025
3pm █▉ +15
4pm ▍ +3
5pm ▋ +5
6pm ▉ +7
7pm ▋ +5
8pm ▋ +5
9pm ▊ +6
10pm ▊ +6
11pm █ +8
21 Apr 2025
12am ▍ +3
1am ▉ +7
2am ▉ +7
#rust
#ai, #changedatacapture, #data, #dataengineering, #dataindexing, #datainfrastructure, #dataprocessing, #dataflow, #etl, #helpwanted, #indexing, #knowledgegraph, #llm, #pipeline, #python, #rag, #realtime, #rust, #semanticsearch, #streaming
spiceai/spiceai
A portable accelerated data query and LLM-inference engine, written in Rust, for data-grounded AI apps and agents.
Language:Rust
Total stars: 2249
Stars trend:
#rust
#artificialintelligence, #data, #developers, #infrastructure, #machinelearning, #sql, #timeseries
A portable accelerated data query and LLM-inference engine, written in Rust, for data-grounded AI apps and agents.
Language:Rust
Total stars: 2249
Stars trend:
22 Apr 2025
12pm ▏ +1
1pm █ +8
2pm ▉ +7
3pm █▌ +12
4pm █▊ +14
5pm █▎ +10
6pm ▋ +5
7pm ▊ +6
8pm ▊ +6
9pm █▋ +13
#rust
#artificialintelligence, #data, #developers, #infrastructure, #machinelearning, #sql, #timeseries
meta-llama/synthetic-data-kit
Tool for generating high quality Synthetic datasets
Language:Python
Total stars: 104
Stars trend:
#python
#data, #generation, #llm, #python, #synthetic
Tool for generating high quality Synthetic datasets
Language:Python
Total stars: 104
Stars trend:
29 Apr 2025
4pm █▋ +13
5pm █▋ +13
6pm ▍ +3
7pm ▊ +6
8pm ▋ +5
9pm ██▎ +18
10pm █ +8
11pm ▋ +5
30 Apr 2025
12am ▋ +5
1am ▎ +2
2am ▉ +7
3am ▊ +6
#python
#data, #generation, #llm, #python, #synthetic
cocoindex-io/cocoindex
Real-time data transformation framework for AI. Ultra performant, with incremental processing.
Language:Rust
Total stars: 1459
Stars trend:
#rust
#ai, #changedatacapture, #data, #dataengineering, #dataindexing, #datainfrastructure, #dataprocessing, #dataflow, #etl, #helpwanted, #indexing, #knowledgegraph, #llm, #pipeline, #python, #rag, #realtime, #rust, #semanticsearch, #streaming
Real-time data transformation framework for AI. Ultra performant, with incremental processing.
Language:Rust
Total stars: 1459
Stars trend:
20 May 2025
1am ▊ +6
2am ▍ +3
3am ▊ +6
4am █▎ +10
5am █▎ +10
6am ▋ +5
7am █▎ +10
8am ▎ +2
9am ▌ +4
10am █▎ +10
11am ▉ +7
12pm █ +8
#rust
#ai, #changedatacapture, #data, #dataengineering, #dataindexing, #datainfrastructure, #dataprocessing, #dataflow, #etl, #helpwanted, #indexing, #knowledgegraph, #llm, #pipeline, #python, #rag, #realtime, #rust, #semanticsearch, #streaming
speedyapply/2025-AI-College-Jobs
2025 AI/ML internship & new graduate job list updated daily
Language:
Total stars: 1330
Stars trend:
#ai, #applications, #artificialintelligence, #college, #data, #datascience, #fulltime, #hiring, #intern, #internship, #internships, #interviews, #jobbboard, #jobs, #junior, #machinelearning, #ml, #newgrad, #quant, #university
2025 AI/ML internship & new graduate job list updated daily
Language:
Total stars: 1330
Stars trend:
31 May 2025
11pm ▏ +1
1 Jun 2025
12am +0
1am ▋ +5
2am ▌ +4
3am ▉ +7
4am █▌ +12
5am █▌ +12
6am █▉ +15
7am ▋ +5
8am ▌ +4
9am ▉ +7
10am ▉ +7
#ai, #applications, #artificialintelligence, #college, #data, #datascience, #fulltime, #hiring, #intern, #internship, #internships, #interviews, #jobbboard, #jobs, #junior, #machinelearning, #ml, #newgrad, #quant, #university
DataExpert-io/data-engineer-handbook
This is a repo with links to everything you'd ever want to learn about data engineering
Language:Jupyter Notebook
Total stars: 28322
Stars trend:
#jupyternotebook
#apachespark, #awesome, #bigdata, #data, #dataengineering, #sql
This is a repo with links to everything you'd ever want to learn about data engineering
Language:Jupyter Notebook
Total stars: 28322
Stars trend:
1 Jun 2025
6pm ▏ +1
7pm ▏ +1
8pm ▏ +1
9pm +0
10pm +0
11pm ▊ +6
2 Jun 2025
12am ████ +32
1am █▋ +13
2am █▉ +15
3am █▋ +13
#jupyternotebook
#apachespark, #awesome, #bigdata, #data, #dataengineering, #sql