Open-Source Benchmark for LLM OCR and Data Extraction

2025-04-01
Open-Source Benchmark for LLM OCR and Data Extraction

Omni, an open-source benchmarking tool, compares the OCR and data extraction capabilities of various large multimodal models like gpt-4o, evaluating both text and JSON extraction accuracy. This benchmark provides a comprehensive evaluation of OCR accuracy across traditional OCR providers and LLMs. The dataset and methodologies are open-source, encouraging contributions and expansion. The benchmark focuses on JSON extraction, measuring the accuracy of the entire pipeline: Document ⇒ OCR ⇒ Extraction. It uses a modified json-diff for JSON accuracy and Levenshtein distance for text similarity. The tool supports various models, including OpenAI, Google Gemini, and Anthropic, with a simple command-line interface and JSON output.

Read more
Development

TopoSort: A Highly Efficient Zig Library for Topological Sorting

2025-04-01
TopoSort: A Highly Efficient Zig Library for Topological Sorting

TopoSort is a highly efficient Zig library for performing topological sorting on dependency graphs. It boasts features like dependency graph building, topological sorting, generation of dependency-free subsets for parallel processing, cycle detection and reporting, and support for various node types. A command-line interface tool, `toposort-cli`, simplifies usage. Performance is optimized by setting the `max_range` parameter, potentially speeding up dependency tree building by 3-4 times. TopoSort is MIT licensed.

Read more
Development Topological Sort

RubyUI: Blazing Fast Rails UI Component Library

2025-04-01
RubyUI: Blazing Fast Rails UI Component Library

RubyUI (formerly PhlexUI) is a Rails UI component library built on Phlex, boasting speeds up to 12x faster than traditional ERB templates. It offers a collection of reusable, customizable components that you can copy and paste directly into your applications. RubyUI emphasizes clean design, well-organized components, and extensive customization options, with comprehensive documentation and examples. It uses custom-built Stimulus.js controllers, minimizing dependencies, allowing for the easy creation of efficient and beautiful Rails application interfaces.

Read more
Development UI component library

Control Your Android Phone with AI: Code Your Commands

2025-04-01
Control Your Android Phone with AI: Code Your Commands

The open-source project `mobile-use` lets you control your Android phone using AI. Simply write commands like "Open Instagram and send a message," and the project executes them using your locally installed Android SDK tools (adb). It supports custom LLM models and offers both command-line and file input methods, allowing you to easily send instructions from the terminal or a file. The project is licensed under the MIT License.

Read more
Development

A 149-Byte Minimal PubSub Library

2025-04-01
A 149-Byte Minimal PubSub Library

This article introduces a minimalist PubSub library weighing in at a mere 149 bytes, with virtually no dependencies. It's compared to competitors like nano-pubsub (194 bytes) and tiny-pubsub (401 bytes), highlighting its incredibly small size. Built using EventTarget, the code is clear and concise, and includes TypeScript type definitions. The author encourages contributions to further reduce the library's size.

Read more
Development lightweight library

KOReader: A Powerful Cross-Platform E-reader

2025-04-01
KOReader: A Powerful Cross-Platform E-reader

KOReader is a document viewer primarily designed for e-ink readers, supporting numerous formats (PDF, EPUB, MOBI, etc.) and devices (Kindle, Kobo, etc.). It boasts a highly customizable reading view, multilingual support, integration with tools like Calibre, and e-ink optimizations for fast page turns. Extensible via plugins, KOReader is developed and maintained by a global community of volunteers.

Read more
Development e-reader

Supercharge Your Airflow Pipelines with LLMs: The Apache Airflow AI SDK

2025-03-31
Supercharge Your Airflow Pipelines with LLMs: The Apache Airflow AI SDK

This Apache Airflow AI SDK, built on Pydantic AI, lets you seamlessly integrate large language models (LLMs) into your Airflow workflows. Using decorator-based tasks (@task.llm, @task.llm_branch, @task.agent), it simplifies LLM calls and agent orchestration. Support for various models (OpenAI, Anthropic, etc.), automatic output parsing, and DAG branching capabilities make your data pipelines smarter and more efficient.

Read more
Development

Gumloop's guMCP: An Open-Source Unified Model Context Protocol Server Collection

2025-03-31
Gumloop's guMCP: An Open-Source Unified Model Context Protocol Server Collection

Gumloop has released guMCP, an open-source collection of Model Context Protocol (MCP) servers that run locally and remotely. Aiming to create the largest unified MCP server collection, it fosters a community around AI integrations and the future of AGI. Supporting both stdio and SSE transports, guMCP includes servers for file systems, databases, development tools, web automation, and more, encouraging community contributions. Licensed under GPL-3.0, security is paramount, with compliance to SOC 2 Type II, GDPR, and HIPAA.

Read more
Development

Control WhatsApp with Claude: A WhatsApp MCP Server

2025-03-31
Control WhatsApp with Claude: A WhatsApp MCP Server

This project integrates WhatsApp with the Claude large language model (LLM). A Go bridge connects to the WhatsApp Web multi-device API, storing messages locally in a SQLite database. A Python server implements the Model Context Protocol (MCP), enabling Claude to search messages, contacts, and send messages via a set of tools. All processing is local, prioritizing user privacy.

Read more
Development

Blazing Fast Windows 9x Installation: 90 Seconds or Less!

2025-03-31
Blazing Fast Windows 9x Installation: 90 Seconds or Less!

Tired of slow Windows 98 installations? Windows 9x QuickInstall is here! This project leverages Linux and a custom installer to package an already installed Windows 98 system into an optimized ISO image, enabling installation in roughly 60-90 seconds. It supports Windows 98, Windows 98 SE, and Windows ME, and allows for flexible addition of drivers and tools. While building the ISO is more involved than using nLite, the increased customizability makes it perfect for users who prioritize speed and highly tailored installations.

Read more

Ambermoon Advanced: A Massive RPG Expansion Released in Episodes

2025-03-31
Ambermoon Advanced: A Massive RPG Expansion Released in Episodes

Indie developer Pyrdacor is releasing Ambermoon Advanced, an unofficial expansion for the RPG Ambermoon, in episodic installments. Five episodes are planned, with the first three currently available, featuring content like "Mysteries of the Sea" and "Elemental Creatures." The game runs on Amiga and modern systems, though the Amiga version is still under development. The developer notes that the game is a work in progress and may contain bugs and imbalances. A physical manual is also planned, containing game information and Ambermoon lore.

Read more

Blue95: A Nostalgic 90s Desktop Experience

2025-03-30
Blue95: A Nostalgic 90s Desktop Experience

Blue95 offers a lightweight desktop experience reminiscent of the 1990s, built on Fedora Atomic Xfce with the Chicago95 theme. While the included installer is still in alpha, the recommended installation method is rebasing from another Fedora Atomic desktop, such as winblues/vauxite. Blue95 prioritizes usability and accessibility over exact replication of older operating systems, incorporating improvements from Universal Blue (codecs, automatic updates, etc.) for a better out-of-the-box experience. A live ISO is also available for testing without installation.

Read more
Development Retro Desktop

rr Debugger Gets a Major Update: Software Counters Mode

2025-03-30
rr Debugger Gets a Major Update: Software Counters Mode

The rr debugger now features a Software Counters mode, enabling it to run without access to CPU hardware performance counters, overcoming limitations in cloud VMs and containers. This mode leverages lightweight dynamic and static instrumentation to achieve record and replay functionality, simplifying debugging workflows. The author's blog post details the principles of record and replay technology, highlights the advantages of the rr debugger, and shares their development journey.

Read more
Development software counters

PG-MCP: Enhanced PostgreSQL Model Context Protocol Server

2025-03-30
PG-MCP: Enhanced PostgreSQL Model Context Protocol Server

PG-MCP is a Model Context Protocol (MCP) server for PostgreSQL databases, offering enhanced capabilities for AI agents. Building upon the reference Postgres MCP implementation, it adds multi-database support, rich metadata including PostGIS and pgvector extensions, query plan analysis tools, and robust connection management. Deployable via Docker or a virtual environment, developers can interact with PG-MCP using the provided Python client, even leveraging large language models like Claude for natural language to SQL conversion.

Read more
Development

msgpack23: A Modern C++ MessagePack Library

2025-03-30
msgpack23: A Modern C++ MessagePack Library

msgpack23 is a lightweight, header-only C++ library for serializing and deserializing data to the MessagePack format. Leveraging modern C++ features (C++20 and beyond), it offers a flexible, zero-dependency solution supporting various data types including STL containers, time points, and custom types. Its simple API, performance focus, and extensibility make it a powerful tool for efficient data handling.

Read more
Development

xan: A Blazing-Fast CLI Tool for CSV Processing

2025-03-29
xan: A Blazing-Fast CLI Tool for CSV Processing

xan is a command-line tool built in Rust for lightning-fast processing of massive CSV files (gigabytes!). Leveraging multithreading for parallelism, it easily handles tasks like previewing, filtering, slicing, aggregating, sorting, and joining CSV data. xan boasts a powerful expression language surpassing the speed of Python, Lua, or JavaScript for complex operations. Originally forked from xsv but extensively rewritten, xan caters to social science data analysis needs, including lexicometry, graph theory, and even web scraping. Installation is simple via cargo, Homebrew, pacman, Nix, or pre-built binaries.

Read more
Development CSV processing

JobHuntr: Automate Your LinkedIn Job Applications with AI

2025-03-29
JobHuntr: Automate Your LinkedIn Job Applications with AI

JobHuntr.fyi is a macOS desktop app leveraging Ollama-powered AI to automatically apply for jobs on LinkedIn, 24/7, without needing an OpenAI API key. It targets jobs with the "Easy Apply" option (around 70% of listings), handling up to 25 applications daily. The AI intelligently fills out applications based on your resume and FAQ, carefully verifying information. Uncertain questions are saved to your FAQ and applications are skipped. Designed to mimic human behavior and using your IP address, it minimizes the risk of LinkedIn detecting automation.

Read more
Development Job Applications

LVGL: A Lightweight and Versatile Embedded Graphics Library

2025-03-29
LVGL: A Lightweight and Versatile Embedded Graphics Library

LVGL is a popular open-source embedded graphics library for creating beautiful UIs for any MCU, MPU, and display type. It boasts 30+ built-in widgets, a powerful style system, web-inspired layout managers, and supports multiple languages. Requiring minimal resources (32kB RAM and 128kB Flash), LVGL simplifies UI development. The LVGL team also offers design, implementation, and consulting services to assist developers in building high-quality GUIs.

Read more

RTABench: A New Benchmark for Real-Time Analytics Applications

2025-03-29
RTABench: A New Benchmark for Real-Time Analytics Applications

Traditional analytics benchmarks often overlook the needs of real-time applications, such as generating fast, targeted insights for specific users, devices, or transactions. RTABench addresses this gap by providing a benchmark that accurately reflects real-time analytics within applications, using a normalized schema, realistic dataset sizes, and queries that match real-world usage patterns. It includes 33 queries covering raw event queries, selective filtering, multi-table joins, and pre-aggregated queries to assess database performance on normalized schemas, selective filtering, and incremental materialized views. RTABench supports multiple databases and welcomes community contributions to expand its database support and optimizations.

Read more

vramfs: Turn Your GPU VRAM into a Filesystem

2025-03-29
vramfs: Turn Your GPU VRAM into a Filesystem

vramfs is a utility leveraging the FUSE library to create a filesystem in your GPU's VRAM. Similar to a ramdisk, but using video RAM, it's not for high-intensity use but works surprisingly well on consumer GPUs with 4GB+ VRAM. The developer achieved ~2.4 GB/s read and 2.0 GB/s write speeds, about 1/3 of a ramdisk. It uses OpenCL for memory management and FUSE for simplified development. Future improvements aim to reach PCI-e bandwidth limits.

Read more
Development Filesystem

Learn Japanese Grammar with TypeScript: Introducing Typed Japanese

2025-03-29
Learn Japanese Grammar with TypeScript: Introducing Typed Japanese

Typed Japanese is a TypeScript type-level library that allows you to express complete Japanese sentences using the type system. It creates a domain-specific language (DSL) based on Japanese grammar rules, enabling the writing and verification of grammatically correct natural language using TypeScript's compiler. The project also explores an intermediate format for AI in language learning, potentially replacing JSON with a type-checked representation for improved accuracy. It supports various verb and adjective conjugations, phrase and sentence construction, aiming to create a type system for learning and verifying Japanese grammar. While still in early stages and relying on LLM-generated rules, it offers a unique approach to language learning and grammar verification.

Read more

Real-Time Chess: A Physical Board That Eliminates Turns

2025-03-29
Real-Time Chess: A Physical Board That Eliminates Turns

Tired of the long waits in turn-based chess? A developer has created a real-time physical chessboard that eliminates turns entirely. Each piece has an individual cooldown, enforced by electronics and electromagnets, preventing cheating. The project's PCB designs and firmware are open-source, but the author notes issues like inadequate power distribution and tight tolerances.

Read more
Hardware

bknd: A Lightweight, Full-Featured Backend for Simplified App Development

2025-03-29
bknd: A Lightweight, Full-Featured Backend for Simplified App Development

bknd simplifies app development by providing a fully functional backend for database management, authentication, media, and workflows. Built on web standards, it's lightweight and deployable nearly anywhere, including within your framework of choice. No more deploying multiple separate services! It offers a REST API, React SDK, and React elements for authentication and media components, along with an easy-to-use admin UI. While still under active development (pre-v1.0.0), full backward compatibility isn't yet guaranteed.

Read more
Development

ESP32 Pomodoro Timer: A Hardware and Software DIY Project

2025-03-29
ESP32 Pomodoro Timer: A Hardware and Software DIY Project

A software engineer, driven by a quest for productivity enhancement and a desire to help a friend, designed an ESP32-based Pomodoro timer. This timer uses an e-paper display and a rotary encoder, prioritizing a physical device, fun, and intuitive usability. The project encompasses hardware selection (ESP32, e-paper display, rotary encoder, LED, etc.), software development (C++, Arduino framework, GxEPD2 library), and 3D-printed case design. The author shares lessons learned throughout the design process, such as display selection, input method design, LED indicator implementation, and software code management. The final product combines practicality and fun, incorporating Easter eggs.

Read more

PCIe Endpoint on Xilinx 7-Series FPGAs: Open-Source Implementation

2025-03-29
PCIe Endpoint on Xilinx 7-Series FPGAs: Open-Source Implementation

This project implements a PCIe endpoint on Xilinx 7-series FPGAs using the PCIE_2_1 hard block and GTP transceivers. It avoids proprietary Vivado IP cores and is compatible with openXC7. The design includes clock generation, GTP transceivers, and the PCIE_2_1 hard block, supporting PCIe Gen1 x1 and Gen2 x1. It's been tested on Alinx AC7100B SoM and Wavelet uSDR. Docker build and run scripts are provided, along with MSI interrupt and kernel driver support. This project is funded by NGI0 Entrust.

Read more
Hardware

Postgres Language Server: A Powerful SQL Toolchain for Developers

2025-03-29
Postgres Language Server:  A Powerful SQL Toolchain for Developers

This project offers a comprehensive toolchain for Postgres development, built upon Postgres' own parser (libpg_query) for guaranteed 100% syntax compatibility. Employing a server-client architecture with transport-agnostic design, it provides access via LSP, CLI, HTTP APIs, and WebAssembly. Current features include autocompletion, syntax highlighting, type checking (using EXPLAIN insights), and a linter. Future development focuses on enhancing these core features and building a robust infrastructure. Contributions are welcome!

Read more
Development Developer tools

Self-Hosting a Nearly Free, Open Source TURN Server on Oracle Cloud

2025-03-28
Self-Hosting a Nearly Free, Open Source TURN Server on Oracle Cloud

This guide details setting up a nearly free, open-source TURN server on Oracle Cloud's free tier, offering an alternative to WebRTC implementations relying on third-party services. It walks through creating subnets, security lists, instance configurations, installing Nginx, Certbot, a PeerJS server, and a Coturn server. The guide covers configuring iptables rules and using coturn-credential-api for authentication, providing complete control over WebRTC communication and enhancing privacy and security by avoiding reliance on Google or other large providers.

Read more

Hexi: A Lightweight C++23 Library for Safe Binary Data Handling

2025-03-28
Hexi: A Lightweight C++23 Library for Safe Binary Data Handling

Hexi is a lightweight, header-only C++23 library for safely handling binary data from arbitrary sources (primarily network data). It bridges the gap between manually memcpying bytes and full-blown serialization libraries. Designed for ease of use, safety with untrusted data, flexibility, and minimal overhead, Hexi supports custom containers (including non-contiguous ones), exception handling, and bounds checking. Additional features include buffer types for binary files, static/dynamic buffers, and a thread-local block allocator.

Read more
Development Binary Data

A Tiny Forth for the 6502: Under 600 Bytes

2025-03-28
A Tiny Forth for the 6502: Under 600 Bytes

This article details a highly minimized Forth implementation for the 8-bit 6502 CPU, achieving a size of under 600 bytes. The author compares two interpreter models: Direct Threaded Code (DTC) and Minimal Threaded Code (MTC), opting for DTC for its smaller size. The project focuses on size over performance, aiming to verify standard DTC against MTC variations. The resulting Forth includes core primitives and is tested with `my_hello_world.FORTH`, demonstrating functionality.

Read more
Development
1 2 27 28 29 31 33 34 35 50 51