GitHub Trends
10.1K subscribers
15.3K links
See what the GitHub community is most excited about today.

A bot automatically fetches new repositories from https://github.com/trending and sends them to the channel.

Author and maintainer: https://github.com/katursis
Download Telegram
#rust #events #forwarder #logs #metrics #observability #parser #pipeline #router #rust #stream_processing #vector

Vector is a powerful tool for managing your observability data, such as logs and metrics. It allows you to collect, transform, and route your data to any vendor you choose, giving you full control. Vector is reliable, fast (up to 10x faster than alternatives), and secure. It helps reduce costs, improve data quality, and consolidate agents, making your observability processes more efficient and reliable. With a strong community support and extensive documentation, Vector is used by many big companies and is downloaded over 100,000 times daily. This makes it a valuable tool for anyone looking to manage their data effectively.

https://github.com/vectordotdev/vector
👍1
#python #ai4science #document_analysis #extract_data #layout_analysis #ocr #parser #pdf #pdf_converter #pdf_extractor_llm #pdf_extractor_pretrain #pdf_extractor_rag #pdf_parser #python

MinerU is a tool that converts PDFs into machine-readable formats like markdown or JSON. Here are the key benefits and features MinerU removes headers, footers, and other unnecessary elements to ensure the text is semantically coherent and in human-readable order, even for complex layouts.
- **Structure Preservation** It extracts images, image descriptions, tables, and table titles.
- **Formula Conversion** Recognizes tables and converts them to LaTeX or HTML format.
- **OCR Support** Supports multiple output formats and various visualization results.
- **GPU and CPU Compatibility**: Works on both CPU and GPU environments, compatible with Windows, Linux, and Mac.

You can try MinerU through an online demo, a quick CPU demo, or by using a GPU for faster processing. For detailed usage, refer to the command line options, API integration, and deployment guides provided.

https://github.com/opendatalab/MinerU
#python #docx #llm #parser #pdf #powerpoint

MegaParse is a powerful tool that helps you parse different types of documents like text, PDFs, PowerPoint presentations, and Word documents without losing any information. It is fast, efficient, and supports many file formats. You can use it for free since it is open source. To use MegaParse, you just need to install it with a simple command and set up some additional tools depending on your needs. This tool benefits you by making it easy to extract data from various documents quickly and accurately, saving you time and effort.

https://github.com/QuivrHQ/MegaParse
#go #command_line #commands #flags #go #golang #parser #struct #tags

Kong is a tool for parsing command-line inputs in Go programs. It helps you create complex command-line structures with minimal effort. Here’s how it benefits you You define your command-line structure using Go types, and Kong maps the command-line arguments to these types automatically.
- **Auto-Generated Help** You can handle commands either by switching on the command string or by attaching a `Run()` method to each command, making your code more robust.
- **Customization Options** It supports validation and hooks that can be used to perform actions before or after parsing the command-line, giving you more control over the process.

Overall, Kong simplifies the process of creating and managing command-line interfaces in Go, making your development process more efficient.

https://github.com/alecthomas/kong
👍1
#rust #code_analyzer #code_style #coding_standards #formatter #lexer #linter #parser #php #static_analysis #type_checker

Mago is a very fast tool for PHP that checks your code for errors, fixes many problems automatically, and formats it neatly. It is built using Rust, which makes it much faster and more reliable than many other PHP tools. Mago helps you find bugs, improve code quality, and keep your code clean and consistent with less effort. It also offers deep analysis to catch tricky issues and lets you see your code’s structure visually. This means you can write better PHP code faster and with more confidence. Installation is easy on macOS and Linux using a simple script or other methods like Homebrew or Composer.

https://github.com/carthage-software/mago