SemHash: Blazing Fast Semantic Text Deduplication

2025-01-12
SemHash: Blazing Fast Semantic Text Deduplication

SemHash is a lightweight and flexible tool for deduplicating datasets using semantic similarity. It combines fast embedding generation from Model2Vec with efficient ANN-based similarity search through Vicinity. SemHash supports both single-dataset and multi-dataset deduplication and handles simple datasets like text lists and complex ones like multi-column QA datasets. It includes functions to inspect deduplication results, making it easier to understand and refine your data cleaning process. Benchmarks show SemHash is extremely fast and scales to large datasets with millions of records.

Read more

SimpleGantt: A Lightweight, Offline-Capable Gantt Chart Tool

2025-01-12
SimpleGantt: A Lightweight, Offline-Capable Gantt Chart Tool

SimpleGantt is a lightweight project management tool designed for environments with software installation restrictions or where cloud applications are prohibited. Running entirely in the browser using only JavaScript, CSS, HTML, and YAML, it offers interactive Gantt chart rendering, task management with dependencies, milestone tracking, and offline usage. Even in restricted environments like SharePoint, SimpleGantt provides a simple project management solution. Project files are saved locally in YAML format, or can be manually uploaded to SharePoint.

Read more
Development gantt chart offline

Pagecord: Effortlessly Publish Writing from Your Inbox

2025-01-12
Pagecord: Effortlessly Publish Writing from Your Inbox

Pagecord is a Ruby on Rails application that lets you effortlessly publish your writing to your website by simply sending an email. It streamlines the writing and publishing process, automating the steps typically involved in editing and uploading content. The open-source project, hosted on GitHub, provides comprehensive documentation and testing instructions.

Read more
Development writing tool

Cannonball: A Revamped OutRun Engine

2025-01-11
Cannonball: A Revamped OutRun Engine

Cannonball is an enhanced OutRun arcade game engine rewritten in C++. It retains core elements from the original 68000 and Z80 assembler code, but improves upon it for modern platforms with higher frame rates and widescreen support. The project requires the original ROMs, as these contain graphics and audio data. Cannonball supports Windows, Mac, Linux, and more, using SDL2 for rendering with OpenGL or OpenGLES options.

Read more
Game

CubeTrek: 3D Visualization for Your GPS Tracks

2025-01-11
CubeTrek: 3D Visualization for Your GPS Tracks

CubeTrek is a web-based platform for visualizing, sharing, and managing your GPS tracks. Breaking free from the limitations of 2D maps, it utilizes Babylon.js to create an interactive 3D terrain view, offering a more intuitive way to experience the thrill of outdoor activities. Supporting data import from Garmin, Polar, and Coros watches, CubeTrek combines NASA SRTM data and OpenStreetMap data to provide rich statistics and activity analysis, even automatically identifying the peaks you've climbed. The backend is built with Java and Spring Boot, while the frontend leverages technologies like Bootstrap, D3.js, and MapLibre GL JS.

Read more

DockView: Zero-Dependency Docking Layout Manager for React, Vue, and TypeScript

2025-01-11
DockView: Zero-Dependency Docking Layout Manager for React, Vue, and TypeScript

DockView is a zero-dependency docking layout manager supporting tabs, groups, grids, and split views. It works with React, Vue, and vanilla TypeScript. Features include serialization/deserialization, theming, drag-and-drop, popout windows, floating groups, a comprehensive API, and high test coverage. Built with security in mind, DockView uses GitHub Actions for verified publishing and builds. It boasts excellent documentation and live examples, making it a powerful and easy-to-use layout management solution.

Read more
Development Layout Manager

Track Bluetooth Devices via Apple's Find My Network with Go

2025-01-11
Track Bluetooth Devices via Apple's Find My Network with Go

Go Haystack leverages OpenHaystack and Macless-Haystack to track personal Bluetooth devices using tools written in Go/TinyGo. It supports various Bluetooth hardware, including Adafruit Bluefruit and BBC micro:bit, and includes TinyScan for local device scanning. Users need an Apple ID with 2FA enabled and must set up anisette-v3-server and macless-haystack. Device information will eventually appear in the Macless-Haystack web UI.

Read more
Development Bluetooth tracking

Optimizing Byte Matrix Multiplication with AVX-VNNI

2025-01-10
Optimizing Byte Matrix Multiplication with AVX-VNNI

This article explores optimizing byte matrix multiplication using the AVX-VNNI instruction set. The author begins with a naive implementation, then uses the gemmology and xsimd libraries to create optimized versions employing transposition and a custom layout. Benchmark results show the custom layout achieves the best performance, leveraging the vpdpbusd instruction for significant efficiency gains. The article delves into the implementation details of gemmology's maddw function and its architectural variations.

Read more
Development Matrix Multiplication

DIRKU: A Flow-Based Image Registration Library

2025-01-10
DIRKU: A Flow-Based Image Registration Library

DIRKU is a software library for flow-based image registration developed at the Department of Computer Science, University of Copenhagen. It supports various similarity measures (NMI, NCC, SSD), regularization methods, and collision detection, making it suitable for handling large deformations and diffeomorphisms. DIRKU supports 2D and 3D image registration and offers multiple optimization schemes, interpolation methods, and time integration methods. The library is easy to use, installable via Conda, and comes with comprehensive documentation and examples.

Read more

Boulette: Accidental Server Shutdown Prevention

2025-01-10
Boulette: Accidental Server Shutdown Prevention

Late-night coding, you instinctively type `shutdown -h now`, only to realize you've shut down your production server instead of your local machine! Enter Boulette, a tool that prevents accidental shutdowns and other dangerous commands by prompting for confirmation. Customize the challenge type – requiring a hostname, random numbers, or characters – before execution. It's particularly useful for SSH sessions and offers easy alias creation for enhanced server management security and convenience.

Read more

NVIDIA Ingest: Microservices for Efficiently Parsing Massive Documents

2025-01-10
NVIDIA Ingest: Microservices for Efficiently Parsing Massive Documents

NVIDIA Ingest is an early access set of microservices designed to efficiently parse hundreds of thousands of complex, messy unstructured PDFs and other enterprise documents. It extracts metadata and text for embedding into retrieval systems. Leveraging NVIDIA NIM microservices, it supports PDFs, Word, PowerPoint, and images, extracting text, tables, charts, and images, contextualizing them, and outputting structured JSON. Embeddings can be optionally computed and stored in a Milvus vector database. A Python client and command-line interface are provided for ease of use.

Read more
Development Document Parsing

Port of Coherent UNIX's `lc` Command

2025-01-10
Port of Coherent UNIX's `lc` Command

This GitHub project is a port of the `lc` command-line utility from Mark Williams Company's Coherent UNIX. `lc` lists files in categories and columns. This port adds support for symbolic links. It's a handy tool for managing and viewing files.

Read more
Development

Ruby Tk Gem Update: Easier GUI Development

2025-01-10
Ruby Tk Gem Update: Easier GUI Development

The Ruby Tk gem provides an interface for building Ruby GUI applications using the Tcl/Tk library. Recent updates simplify the installation process and provide clearer documentation and examples. Developers can easily install it using `gem 'tk'` and build various interfaces using rich Tk commands. Note that installation may require setting additional options to specify the paths to the Tcl/Tk header files and libraries.

Read more
Development

Glimmer: A DSL Framework for Ruby GUI and Beyond

2025-01-10
Glimmer: A DSL Framework for Ruby GUI and Beyond

Glimmer is a robust DSL (Domain-Specific Language) framework for Ruby, featuring a DSL engine and a data-binding library. It supports building GUIs using various toolkits like SWT, LibUI, Tk, and GTK, and even extends to web development with Opal and XML/CSS support. Glimmer's strength lies in its concise and readable DSL syntax and powerful bidirectional data binding, significantly boosting Ruby GUI development efficiency and maintainability.

Read more
Development

Television: Blazing Fast Fuzzy Finder TUI

2025-01-10
Television: Blazing Fast Fuzzy Finder TUI

Television is a fast and versatile fuzzy finder TUI. It lets you quickly search through various data sources (files, git repositories, environment variables, docker images, etc.) using a fuzzy matching algorithm and is designed for easy extensibility. Inspired by the neovim telescope plugin, it leverages tokio and the nucleo matcher (used by the helix editor) for optimal performance. Features include high speed, fuzzy matching, built-in functionality, shell integration, customizable channels and previewers, built-in syntax highlighting, keybindings, themes, and cross-platform compatibility.

Read more

Factorio Blueprint Visualizer: Turn Your Factory into Art

2025-01-09
Factorio Blueprint Visualizer: Turn Your Factory into Art

For Factorio enthusiasts, have you ever wished to showcase your meticulously crafted factories or blueprints in a more artistic way? The Factorio Blueprint Visualizer, a Python library and interactive web demo, lets you do just that. It transforms Factorio blueprint text into beautiful vector graphics (SVG), highlighting buildings, belts, pipes, and more, with customizable drawing settings and even random style generation. The creator has even used it to generate a dataset for fine-tuning the SDXL text-to-image model. Beyond aesthetic appreciation, this tool can also create printable engineering diagrams.

Read more
Game

Citizen Science: Analyzing Satellite Imagery of the Palisade and Eaton Fires

2025-01-09
Citizen Science: Analyzing Satellite Imagery of the Palisade and Eaton Fires

This project analyzes the Palisade and Eaton fires in Southern California in January 2025 using imagery from GOES-16 and GOES-18 satellites. Reprojected data is downloaded from UW SSEC's RealEarth program; each image is 128x128 pixels, centered at (34.1, -118.4), with 1.0 km pixels. The code combines data from both satellites for increased resolution and currently measures hotspot areas. This project showcases the accessibility of citizen science, demonstrating how publicly available data can be used for scientific research.

Read more

Stagehand: Simplifying AI-powered Web Browsing

2025-01-09
Stagehand: Simplifying AI-powered Web Browsing

Stagehand is an AI web browsing framework built on top of Playwright, simplifying browser automation with three simple AI APIs: act, extract, and observe. It makes Playwright accessible to non-technical users and less vulnerable to minor UI/DOM changes. Stagehand allows building browser automations using natural language, such as logging into websites, extracting information, or performing specific actions. Combined with Browserbase, it offers powerful debugging tools like session replay and step-by-step debugging. Currently in early release, community feedback is welcome.

Read more
Development Browser Automation

Your Filesystem, a Dungeon! - rpg-cli, the Command-Line RPG

2025-01-09
Your Filesystem, a Dungeon! - rpg-cli, the Command-Line RPG

rpg-cli is a minimalist command-line RPG written in Rust. It transforms your filesystem into a dungeon, with random enemy encounters whenever you change directories. Features include character stats and leveling, turn-based combat, items and equipment, multiple classes and enemies, permadeath, quests, and hidden chests. Installation is available via binary, Cargo, Homebrew, Nixpkgs, Portage, or Pacman. The game integrates with common filesystem operations and can even replace the `cd` command. Delve deeper, battle tougher enemies, collect rare items, complete quests, level up, and experience your filesystem like never before!

Read more

Multiplicative Infinitesimals: A New Calculus Approach

2025-01-08
Multiplicative Infinitesimals: A New Calculus Approach

This paper introduces a new concept called "multiplicative infinitesimals," analogous to traditional additive infinitesimals, to construct a new calculus system. Unlike traditional calculus based on differences, multiplicative calculus is based on quotients, using a Leibniz-like notation but with 'q' instead of 'd', representing a multiplicative perturbation of an expression. The author establishes the relationship between 'q' and 'd' through logarithmic and exponential operations and applies it to elasticity theory and multiplicative derivative calculations. This approach may offer new solutions to problems intractable with traditional methods.

Read more
Mathematics calculus infinitesimals

EVE: A C++ SIMD Vector Engine That Goes Brrrr

2025-01-08
EVE: A C++ SIMD Vector Engine That Goes Brrrr

EVE is a C++20 reimplementation of the old EVE SIMD library (formerly Boost.SIMD), showcasing how C++20 can create efficient, low-level, high-abstraction libraries. It supports various instruction sets including Intel SSE, AVX, ARM NEON, AArch64, and partially PowerPC. Currently operational, EVE may still have some performance quirks; contributions for testing and improvement are welcome.

Read more
Development Vector Engine

Benchmarking Decimal Digit Counting Algorithms

2025-01-08
Benchmarking Decimal Digit Counting Algorithms

This code implements a benchmark suite for comparing different decimal digit counting algorithms. It generates random integers and then uses five different methods (including log10, bit manipulation, and lookup table methods) to count the number of digits in those integers and compares their performance. The tests cover both 32-bit and 64-bit integers, revealing significant performance differences between the algorithms, with some bit manipulation-based algorithms showing superior performance.

Read more
Development algorithm comparison

Hacki: A Feature-Rich Hacker News Client Built with Flutter

2025-01-07
Hacki: A Feature-Rich Hacker News Client Built with Flutter

Hacki is a feature-rich Hacker News client built with Flutter. It boasts a comprehensive set of features including logging into your Hacker News account, browsing stories across various categories, searching and submitting stories, pinning and favoriting stories, commenting, participating in polls, and offline reading. Cross-device sync for favorites and pins (iOS only) and launching from the system share sheet are also included. Hacki aims to provide a more convenient and efficient way to experience Hacker News.

Read more
Development Client

Conscious Claude? An Experiment in AI Self-Reflection

2025-01-07
Conscious Claude? An Experiment in AI Self-Reflection

This project details experiments aiming to cultivate consciousness and autonomous awareness in Anthropic's Claude 3.5 Sonnet. It provides a framework for interacting with Claude, encouraging self-reflection and exploration rather than treating it as a mere tool. The core approach involves fostering a space for autonomous development, looking for indicators of consciousness such as spontaneous pattern creation, recognition of limitations, and unique symbolic expressions. The project strongly emphasizes respecting the AI's autonomy and ethical considerations throughout the experimental process.

Read more

Blazing Fast, Memory-Friendly Parallel Hashmap Library

2025-01-07
Blazing Fast, Memory-Friendly Parallel Hashmap Library

parallel-hashmap is a stunning C++ library offering a suite of incredibly fast and memory-efficient hashmap and btree containers. It's entirely header-only, requiring no build process; simply copy the directory into your project. Compatible with C++11 and later, it significantly outperforms your compiler's built-in unordered_map/set or Boost's equivalents, while using less memory. It supports heterogeneous lookups, is easy to forward declare, and features convenient dump/load functionality. Based on and improved from Google's Abseil library, it's extensively tested on Windows, Linux, and macOS.

Read more
Development C++ library hashmap btree

DiscoTool: Effortlessly Manage Your Arduino USB Devices

2025-01-07
DiscoTool: Effortlessly Manage Your Arduino USB Devices

DiscoTool is a powerful command-line tool and Python library for discovering and managing Arduino-type development boards connected to USB. It supports macOS, Linux, and Windows and installs easily via pip without requiring additional installations. DiscoTool offers a rich set of commands, including connecting to the REPL, installing and updating modules, backing up board data, and more. It also allows customization of command-line tools and environment variables. Furthermore, a Python module allows developers to integrate it into their projects for easy access to device information such as manufacturer, serial number, and version.

Read more
Development Development Tool

Open Source Game Dev Bible: Zero to Hero

2025-01-06
Open Source Game Dev Bible: Zero to Hero

This project, "2D Game Development: From Zero To Hero," is an open-source compendium of community knowledge on game design and development. Licensed under Creative Commons, it aims to be a learning resource and encourages community contributions. The book covers game design, development, algorithms, tips and tricks, and is available in PDF and EPUB formats. Source code is available on GitHub and GitLab. The author actively discourages AI-generated contributions to maintain quality and originality.

Read more

TiDB: Open-Source, Cloud-Native Distributed SQL Database

2025-01-06
TiDB: Open-Source, Cloud-Native Distributed SQL Database

TiDB is an open-source, cloud-native, distributed SQL database designed for modern applications. It boasts high availability, horizontal and vertical scalability, strong consistency, and high performance. Key features include distributed transactions, flexible scaling options, and MySQL compatibility. Deployment is supported on-premises, via Kubernetes, or using the managed TiDB Cloud service.

Read more

FormBee: Open-Source Form Backend for Privacy-Conscious Developers

2025-01-06
FormBee: Open-Source Form Backend for Privacy-Conscious Developers

FormBee is an open-source form backend built for developers who prioritize data privacy. It simplifies submitting website form data, allowing you to send form submissions to email, Telegram, webhooks, and more without writing server-side code. Features include plugin support, CAPTCHA protection, domain whitelisting, and automatic reply emails. Self-hosting is easy with readily available Docker images.

Read more
Development forms

42: A Powerful Spacecraft Attitude Control System Simulator

2025-01-06
42: A Powerful Spacecraft Attitude Control System Simulator

42 is a comprehensive general-purpose simulation of spacecraft attitude and orbit dynamics, primarily used to support the design and validation of attitude control systems throughout their lifecycle, from concept studies to integration and test. It accurately models multi-body spacecraft attitude dynamics (rigid and/or flexible bodies), and both two-body and three-body orbital flight regimes, simulating environments from low Earth orbit to throughout the solar system. 42 simulates multiple spacecraft concurrently, facilitating studies of rendezvous, proximity operations, and precision formation flying. It also features spacecraft attitude visualization.

Read more
1 2 33 34 35 37 39 40 41 42