GitHub Trends
10.1K subscribers
15.3K links
See what the GitHub community is most excited about today.

A bot automatically fetches new repositories from https://github.com/trending and sends them to the channel.

Author and maintainer: https://github.com/katursis
Download Telegram
#cplusplus #ai #analytics #big_data #clickhouse #cpp #dbms #distributed_database #hacktoberfest #mpp #olap #rust #sql

ClickHouse is a free, open-source database that helps you get real-time analytical data reports. It's easy to install using a simple command on Linux, macOS, or FreeBSD. You can find lots of helpful resources like tutorials, documentation, and videos on their website. There are also community meetups and online chats where you can learn from other users. Using ClickHouse benefits you by allowing fast and efficient analysis of large amounts of data, which is useful for making quick decisions and improving your business operations.

https://github.com/ClickHouse/ClickHouse
#python #agent #ai #data_visualization #database #llm #rag #sql #text_to_sql

Vanna is a tool that helps you generate SQL queries easily. Here’s how it works: you train a model with your database information, and then you can ask questions to get the corresponding SQL queries. This process is simple and doesn't require you to know the technical details underneath. The benefits include high accuracy, security since your data stays local, and the ability to use it with any SQL database. You can also customize the interface to suit your needs, such as using Jupyter Notebooks, Slack, or web apps. This makes it easier and faster to work with your database without writing complex SQL queries manually.

https://github.com/vanna-ai/vanna
#other #apachespark #awesome #bigdata #data #dataengineering #sql

This handbook is a comprehensive guide to help you become a great data engineer. It provides a roadmap to get started, including hands-on projects, interview tips, and recommended books. You can join various communities and follow newsletters to stay updated. The handbook also lists top companies, blogs, whitepapers, YouTube channels, podcasts, and courses that can help you learn and grow in data engineering. Using these resources, you can gain practical knowledge, network with professionals, and stay informed about the latest trends and technologies in the field. This will help you build a strong foundation and advance your career as a data engineer.

https://github.com/DataExpert-io/data-engineer-handbook
#go #database #go #mariadb #mysql #mysql_driver #sql

The Go-MySQL-Driver is a lightweight and fast MySQL driver for Go, written purely in Go without any C bindings. It supports connections over TCP/IPv4, IPv6, Unix domain sockets, and custom protocols. Here are the key benefits You can install it using `go get -u github.com/go-sql-driver/mysql`.
- **Automatic Connection Handling** You can configure various settings like timeouts, charset, collation, and more through the Data Source Name (DSN).
- **Advanced Features** It automatically uses utf8mb4_general_ci by default and allows setting other charsets and collations.

Using this driver simplifies interacting with MySQL databases in Go, providing a robust and configurable solution.

https://github.com/go-sql-driver/mysql
#java #bigdata #data_encryption #data_pipeline #database #database_cluster #database_gateway #database_middleware #distributed_database #distributed_sql_database #distributed_transaction #encrypt #mysql #postgresql #read_write_splitting #shard #sql

Apache ShardingSphere is a powerful tool that helps manage and scale databases. It allows you to break down large databases into smaller pieces (sharding), handle more data traffic (scaling), and secure your data with encryption. This tool works with any database and provides a unified way for applications to interact with multiple databases as if they were one.

The benefits include Your database can handle more data and users without slowing down.
- **Improved Security** Applications only need to communicate with one standardized service, making it simpler to manage.
- **Flexibility**: You can customize the tool to fit your needs using its pluggable architecture.

Overall, Apache ShardingSphere makes managing and scaling databases much easier and more efficient.

https://github.com/apache/shardingsphere
#python #hacktoberfest #pypi #sql #sql_linter

SQLFluff is a tool that helps you write better SQL code. It checks your SQL for mistakes and can even fix many of them automatically. It supports many different SQL dialects, like MySQL, PostgreSQL, and Snowflake, and works with templates like Jinja and dbt. This means you can focus on writing your queries instead of worrying about formatting and errors. You can install it using pip, use it in VS Code, or even run it online. It makes your SQL code cleaner and more consistent, saving you time and effort.

https://github.com/sqlfluff/sqlfluff
#swift #database #database_observation #grdb #spm #sql #sql_builder #sqlite #sqlite_databases

GRDB is a Swift library that helps you interact with SQLite databases in a safe and efficient way. Here’s the key information GRDB allows you to save your application’s permanent data into SQLite databases.
- **SQL Generation** You can get notifications when database values are modified.
- **Robust Concurrency** GRDB helps you evolve the schema of your database as you ship new versions of your application.

### Installation
- You can install GRDB using the Swift Package Manager, CocoaPods, or Carthage (though Carthage is unsupported).

### Usage
- Open a database connection using `DatabaseQueue` or `DatabasePool`.
- Define your database schema and create tables.
- Use the `FetchableRecord`, `TableRecord`, and `PersistableRecord` protocols to define your record types.
- Fetch records using methods like `fetchCursor`, `fetchAll`, `fetchSet`, and `fetchOne`.
- Perform CRUD operations (Create, Read, Update, Delete) using methods like `insert`, `update`, `save`, and `delete`.

### Query Interface
- GRDB provides a query interface that lets you write pure Swift instead of SQL.
- You can filter, order, limit, and join records using chainable methods.
- It supports associations and joins.

### Records
- Records are your application objects that can initialize themselves from rows.
- You can define record types using protocols like `FetchableRecord`, `TableRecord`, and `PersistableRecord`.
- Records can be compared against other records or previous versions to avoid unnecessary updates.

### Error Handling
- GRDB throws `DatabaseError` and `RecordError` for various issues.
- It also uses fatal errors to indicate programmer errors or misuses.
- You can configure error logging and tracing to monitor database operations.

### Encryption
- GRDB supports encryption using SQLCipher.
- You can create or open an encrypted database by providing a passphrase.

### Backup and Interruption
- You can backup a database into another.
- The `interrupt` method can cause any pending database operation to abort.

### FAQ
- GRDB provides answers to common questions about opening connections, SQL queries, general usage, associations,

https://github.com/groue/GRDB.swift
#c_lang #ai #big_data #c #cloudberry #data_analysis #data_warehouse #database #distributed_database #greenplum #mpp #olap #postgres #postgresql #sql

Apache Cloudberry is a powerful, open-source database designed for large-scale data processing and analytics. It is built by the creators of Greenplum Database and uses a newer PostgreSQL kernel, making it suitable for data warehouses and AI/ML workloads. You can easily try it out using a Docker-based sandbox or build it from source on Linux or macOS. The community is active, with many channels for support, discussions, and contributions. This means you can get help quickly, share ideas, and even contribute to the project yourself. It's licensed under the Apache License, Version 2.0, making it free to use and modify. Overall, Apache Cloudberry offers advanced database capabilities and a supportive community, which can greatly benefit users needing robust data management solutions.

https://github.com/apache/cloudberry
#go #api_gateway #application_security #appsec #blueteam #bruteforce #captcha #cve #cybersecurity #firewall #hackers #http_flood #security #self_hosted #sql_injection #vulnerability #waf #web_application_firewall #web_security #websecurity #xss

SafeLine is a tool that helps keep your web applications safe from attacks. It acts like a shield between your website and the internet, filtering and blocking bad traffic. This protects your site from things like SQL injection, XSS, and other types of attacks. SafeLine also has features like rate limiting to stop too much traffic, anti-bot challenges to block bots, and dynamic protection to encrypt your HTML and JS codes. Using SafeLine makes your web apps more secure, which is important to protect your data and users.

https://github.com/chaitin/SafeLine
#haskell #api #automatic_api #database #haskell #http #pg #pgsql #postgres #postgresql #postgrest #rest #server #sql

PostgREST is a tool that turns your PostgreSQL database into a RESTful API quickly and efficiently. It provides a fast, standards-compliant API without you having to write it from scratch. Here are the key benefits It offers subsecond response times and can handle up to 2000 requests per second, even on a free Heroku tier.
- **Security** It generates self-documenting APIs using OpenAPI standards and ensures data integrity by enforcing database constraints.
- **Performance**: Written in Haskell, it uses efficient database connections and lightweight threads for high performance.

Overall, PostgREST makes creating and managing APIs easier, faster, and more secure.

https://github.com/PostgREST/postgrest