DeepSeek-R1: A Reasoning Model Trained with Reinforcement Learning, No Supervised Fine-tuning Needed

2025-01-20
DeepSeek-R1: A Reasoning Model Trained with Reinforcement Learning, No Supervised Fine-tuning Needed

The DeepSeek team open-sourced its first-generation reasoning models, DeepSeek-R1, and a suite of distilled models. DeepSeek-R1-Zero, trained via large-scale reinforcement learning (RL) without supervised fine-tuning (SFT), demonstrates remarkable reasoning capabilities, though it has some flaws. DeepSeek-R1 addresses these issues by incorporating cold-start data before RL, achieving performance comparable to OpenAI-o1. Six distilled models based on Llama and Qwen are also open-sourced, with DeepSeek-R1-Distill-Qwen-32B outperforming OpenAI-o1-mini on various benchmarks. The project supports commercial use and provides an online chat website and an OpenAI-compatible API.

Read more

Running DOOM in Microsoft Word: A VBA Hack That's Actually Impressive

2025-01-20
Running DOOM in Microsoft Word: A VBA Hack That's Actually Impressive

A developer has ported the classic game DOOM into a Microsoft Word document! Using VBA macros and base64 encoding, they embedded the DOOM engine and game data within a Word document, rendering the game in real-time by reading keyboard input. While lacking sound, this unusual project showcases the power of VBA and the developer's creativity, prompting reflection on security and the exploration of software boundaries.

Read more
Game

Rhai: A Safe and Efficient Embedded Scripting Language for Rust

2025-01-20
Rhai: A Safe and Efficient Embedded Scripting Language for Rust

Rhai is an embedded scripting language and evaluation engine for Rust, providing a safe and easy way to add scripting capabilities to any application. It targets all CPU and OS architectures supported by Rust, including WebAssembly, and features a simple, JavaScript-like syntax. Rhai boasts efficient evaluation and tight integration with native Rust functions and types, allowing seamless passing and use of Rust values. Its robust security features protect against malicious attacks such as stack overflows and runaway scripts.

Read more

Using Your Apple Device as an Access Card: A Clever Hack Using a Chinese Transit Card

2025-01-19
Using Your Apple Device as an Access Card: A Clever Hack Using a Chinese Transit Card

Many have tried using their Apple device as an access card, but the closed nature of NFC and Wallet ecosystems makes this difficult. However, a Chinese transit card called "China T-Union," officially supported by Apple Wallet, offers a clever workaround. Its unique properties – a non-randomizing UID and unchanging serial number across devices – allow it to be recognized by some UID-based access control systems. While UID authentication is less secure, some systems support it as a fallback. Obtaining the card requires an Alipay account and a biometric travel document. The method is slightly convoluted, but it provides a viable solution for access systems supporting UID authentication.

Read more

Linux Network Programming Guide: A Deep Dive into Socket Programming

2025-01-19
Linux Network Programming Guide: A Deep Dive into Socket Programming

This guide provides a comprehensive explanation of Linux network programming, focusing on socket programming. The author notes that many online resources lack clarity and sample codes often only cover the basics, hence the creation of this tutorial, offering clear guidelines and numerous examples. Topics covered include socket types, addressing, APIs (getprotobyname(), getservbyname(), getaddrinfo(), htonl(), htons(), ntohl(), ntohs(), socket(), setsockopt(), bind(), listen(), accept(), connect(), recv(), send(), close()), client-server models (simple HTTP client, TCP-based client-server, multithreaded TCP client-server, UDP-based client-server), advanced techniques (non-blocking sockets, synchronous I/O multiplexing with select() and poll(), broadcasting messages), and secure networking with libcurl and OpenSSL.

Read more

Yek: A Fast Tool for Chunking Repository Files for LLMs

2025-01-19
Yek: A Fast Tool for Chunking Repository Files for LLMs

Yek is a fast Rust-based tool designed to read text-based files from a repository or directory, chunk them, and serialize them for Large Language Model (LLM) consumption. It intelligently skips unwanted files (using .gitignore rules and Git history), splits content into chunks based on token count or byte size, and offers extensive configuration options such as custom ignore patterns and file priority rules. Yek aims to streamline the data preparation process for LLMs, boosting efficiency.

Read more
Development

BorrowChecker.jl: An Experimental Borrow Checker for Julia

2025-01-18
BorrowChecker.jl: An Experimental Borrow Checker for Julia

BorrowChecker.jl is a Julia package that simulates Rust-like ownership and borrowing semantics through a macro-based system, performing runtime checks during development and testing to flag memory safety issues. It doesn't guarantee absolute memory safety but aids in writing safer code. The library uses macros like `@own`, `@move`, `@clone` to manage object ownership and `@lifetime` blocks to manage reference lifetimes, while also offering an `@managed` block for automatic borrow checking. While not perfect, it's a valuable tool for developers aiming to improve memory safety in their Julia code.

Read more
Development

llmpeg: Generate FFmpeg Commands with an LLM

2025-01-18
llmpeg: Generate FFmpeg Commands with an LLM

llmpeg simplifies using FFmpeg commands via a large language model (LLM). Simply describe your desired video manipulation, and llmpeg generates the corresponding FFmpeg command. For instance, you can easily remove audio from a video. Requires FFmpeg and an OpenAI API key. This project provides a convenient AI-powered tool for video processing.

Read more
Development Video Processing

Dr.TVAM: An Inverse Rendering Framework for Tomographic Volumetric Additive Manufacturing

2025-01-17
Dr.TVAM: An Inverse Rendering Framework for Tomographic Volumetric Additive Manufacturing

Dr.TVAM is an inverse rendering framework for tomographic volumetric additive manufacturing (TVAM) built on the Mitsuba renderer. It leverages physically-based differentiable rendering to optimize patterns for TVAM, supporting scattering printing media, arbitrary vial shapes, and projector motions. An improved discretization scheme for the target shape is also included. A command-line tool and comprehensive documentation are provided for easy customization and optimization.

Read more

C to FlipJump: A Minimalist RISC-V Virtual Machine

2025-01-17
C to FlipJump: A Minimalist RISC-V Virtual Machine

The c2fj project is a compiler that translates C code into FlipJump, a minimalist instruction set virtual machine. It first compiles C code into a RISC-V ELF file, then translates RISC-V instructions into FlipJump instructions, and finally runs the resulting FlipJump program. This process cleverly leverages FlipJump's space optimization, allowing for efficient compilation even of large C codebases. c2fj supports various compilation flags and provides Makefile support for complex projects. It also includes a minimal C library implementing necessary system calls such as `exit`, `getc`, `putc`, and `sbrk`.

Read more
Development

Nonlinear Optics Sandbox: AI-Powered Control of Complex Optical Systems

2025-01-17
Nonlinear Optics Sandbox: AI-Powered Control of Complex Optical Systems

WestonCB's Nonlinear Optics Sandbox is an interactive platform for studying controlled nonlinear dynamics through the simulation of coupled optical fields. The project uses a specific optical configuration—coupled fundamental/harmonic fields interacting with an adaptive medium—as an idealized model to explore how optimization processes can guide complex nonlinear systems towards desired states. It combines real-time visualization, GPU-accelerated physics simulation, and gradient-based optimization, offering a novel tool for researching nonlinear dynamics and the control of complex systems.

Read more

Nyxelf: A Powerful Tool for Analyzing Malicious Linux ELF Binaries

2025-01-17
Nyxelf: A Powerful Tool for Analyzing Malicious Linux ELF Binaries

Nyxelf is a powerful tool designed for analyzing malicious Linux ELF binaries. It combines static analysis techniques using tools like readelf, objdump, and pyelftools with dynamic analysis within a secure QEMU-based sandbox. Features include UPX unpacking, syscall tracing, process/file activity monitoring, and an intuitive GUI powered by pywebview. JSON output supports automated workflows, making Nyxelf ideal for security researchers and reverse engineers.

Read more

Conquering 100 Project Euler Problems in 100 Languages

2025-01-16
Conquering 100 Project Euler Problems in 100 Languages

A programmer spent over a year solving the first 100 Project Euler problems using 100 different programming languages! From common languages like Python and Java to obscure esoteric languages, the sheer dedication and programming prowess is impressive. This project showcases deep understanding of various programming paradigms and offers a valuable learning experience for programmers of all levels.

Read more
Development challenge

Training Large Diffusion Models on a Shoestring Budget: $1890

2025-01-16
Training Large Diffusion Models on a Shoestring Budget: $1890

Sony Research has open-sourced micro_diffusion, demonstrating how to train large-scale diffusion models on an extremely low budget ($1890). Using 37 million publicly available real and synthetic images, they trained a 1.16 billion parameter sparse transformer model, achieving an FID score of 12.7 on zero-shot generation on the COCO dataset. The project provides training code, dataset code, pre-trained model weights, and details a staged training process, including progressive training from low to high resolution and the use of patch masking to reduce training costs and improve efficiency.

Read more

DBOS Transact: Ultra-Lightweight Durable Execution in TypeScript

2025-01-16
DBOS Transact: Ultra-Lightweight Durable Execution in TypeScript

DBOS Transact is a lightweight TypeScript library for durable execution, leveraging Postgres for persistence. This means your program can automatically resume from where it left off, even if interrupted or crashed. It solves common problems like orchestrating long-running workflows, running reliable background jobs, processing events exactly once, and building fault-tolerant distributed task queues. DBOS Transact is easy to use: install it and annotate your program with decorators. It requires no other dependencies and integrates seamlessly into any TypeScript application, like Next.js, without external dependencies beyond Postgres.

Read more
Development durable execution

Open Source: awesome-launch – Boost Your Side Project Visibility

2025-01-16
Open Source: awesome-launch – Boost Your Side Project Visibility

Struggling to get eyes on your side project? KingMenes' awesome-launch is an open-source directory of free and friendly communities designed to help you promote it. It features a curated list of platforms including Reddit, Twitter, LinkedIn, Hacker News, and Product Hunt, categorized for easy navigation. From Indie Hackers and Dev.to to beta testing platforms and crowdfunding sites, awesome-launch provides a comprehensive resource for side project promotion.

Read more

dnSpyEx: The Unofficial Revival of a Powerful .NET Debugger and Assembly Editor

2025-01-15
dnSpyEx: The Unofficial Revival of a Powerful .NET Debugger and Assembly Editor

dnSpyEx is an unofficial continuation of the popular dnSpy project, a debugger and .NET assembly editor. It lets you edit and debug .NET and Unity assemblies even without source code. Key features include debugging and editing .NET and Unity assemblies, light and dark themes, and powerful decompilation, hex editing, and debugging capabilities such as breakpoint setting, variable windows, call stacks, and exception handling. dnSpyEx also offers a public API for extensions and scripting, and supports multiple language translations.

Read more
Development Decompiler

Keon: A Human-Readable Serialization Format Inspired by Rust

2025-01-15
Keon: A Human-Readable Serialization Format Inspired by Rust

Keon is a human-readable object notation (ORN) and serialization format syntactically similar to Rust and fully supporting Serde's data model. It boasts a cleaner syntax, allowing comments and trailing commas, and enabling a near-Rust-like writing experience. Keon distinguishes between tuples and lists, supports arbitrary types as dictionary keys, and offers Base64, Base32, and Base16 support. The goal is a more intuitive, readable, and writable serialization format.

Read more
Development

Ropey: A High-Performance Text Rope for Rust

2025-01-15
Ropey: A High-Performance Text Rope for Rust

Ropey is a UTF-8 text rope library written in Rust, designed as a backing text buffer for applications like text editors. It's fast, robust, and handles massive texts and memory-incoherent edits with ease. Ropey boasts strong Unicode support, predictable performance, and excels at frequent edits on medium-to-large texts. However, it's not ideal for very small texts or those exceeding available memory. Key features include line awareness, efficient rope slices, and flexible low-level APIs. Ropey has proven itself in various projects and undergoes rigorous testing, providing a reliable solution for text manipulation.

Read more
Development Text Editing

Customasm: An Assembler for Your Own Instruction Sets

2025-01-15
Customasm: An Assembler for Your Own Instruction Sets

Customasm is an assembler that lets you define your own custom instruction sets, perfect for testing the bytecode of a new virtual machine or writing programs for that new microprocessor architecture you just implemented on an FPGA chip! Try it online in your browser, check out an example project targeting the NES, and install the VSCode syntax highlighting extension. Install via `cargo install customasm`, download pre-built executables from Releases, or compile from source. Documentation and a how-to-start guide are available in the wiki.

Read more

Pyper: Concurrent Python Made Simple

2025-01-15
Pyper: Concurrent Python Made Simple

Pyper is a flexible framework for concurrent and parallel data processing, built on functional programming patterns. It seamlessly integrates threaded, multiprocessed, and asynchronous operations, simplifying the creation of ETL systems, data microservices, and data collection pipelines. With a clean API and efficient queueing, Pyper offers a lightweight, pure Python solution (zero dependencies) that prioritizes ease of use, safety, and efficiency. Developers can focus on building logic rather than managing low-level concurrency details.

Read more
Development

Fighting Search Engine Spam: A Powerful Blocklist

2025-01-14
Fighting Search Engine Spam: A Powerful Blocklist

The GitHub project `BadWebsiteBlocklist` offers a powerful filter to block spam and misleading websites from appearing in search results. It aims to clean up search results by removing AI-generated low-quality articles, spam sites, and thinly-veiled advertisements posing as information. The filter is used via the uBlocklist extension; users simply add a subscription link to enable it. The project encourages users to submit spam websites and provides explanations for each blocked site, ensuring transparency and sustainability.

Read more
Development spam filtering

OpenZFS 2.3.0 Released: RAIDZ Expansion, Blazing Fast Deduplication, and More

2025-01-14
OpenZFS 2.3.0 Released: RAIDZ Expansion, Blazing Fast Deduplication, and More

OpenZFS 2.3.0 is here! This release boasts exciting new features including RAIDZ expansion for adding devices to existing RAIDZ pools without downtime, blazing fast deduplication for significantly improved performance, direct I/O for bypassing ARC caching to boost efficiency on devices like NVMe, optional JSON output for most commands, and support for file/directory names up to 1023 characters. Numerous critical bug fixes and performance improvements round out this release. Supported platforms include Linux kernels 4.18-6.12 and FreeBSD releases 13.3, 14.0-14.2.

Read more
Development storage

Campsite Open Source: A Monorepo Deep Dive

2025-01-14
Campsite Open Source: A Monorepo Deep Dive

The Campsite open-source project is a large monorepo containing the entire codebase for their application. While no longer actively maintained, it's a valuable resource for learning how Campsite works and forking for non-commercial projects. The project relies on numerous services, including S3, Pusher, Imgix, 100ms, and OpenAI, requiring extensive configuration for local setup. Detailed instructions are provided for local development, covering environment variable setup, service integration, and running the web app, marketing site, Storybook, and desktop app.

Read more
Development local development

dopy: Python Without Strict Indentation

2025-01-13
dopy: Python Without Strict Indentation

dopy is an experimental Python preprocessor that allows the use of do...end syntax instead of strict indentation. This improves code readability and flexibility, especially for developers familiar with languages like Ruby or Lua. It supports type hints and transpiles .dopy files into PEP 8 compliant .py files. The project is archived and no longer maintained by the original author, but the code remains open-source for learning and reference.

Read more
Development Preprocessor Code Style

JUring: Blazing Fast File I/O in Java with io_uring

2025-01-13
JUring: Blazing Fast File I/O in Java with io_uring

JUring is a high-performance, experimental Java library leveraging Linux's io_uring asynchronous I/O interface via Java's Foreign Function & Memory API for lightning-fast data reads. Benchmarks demonstrate a 33% speed improvement over Java NIO for local files and a remarkable 78% improvement for remote files. Offering both blocking and non-blocking APIs for file reading and writing, JUring is currently under development with planned improvements in instance creation cost, memory usage, and added features like socket support.

Read more
Development High-Performance I/O

Tabby: Your Self-Hosted AI Coding Assistant

2025-01-12
Tabby: Your Self-Hosted AI Coding Assistant

Tabby is an open-source, self-hosted AI coding assistant offering a local alternative to GitHub Copilot. It's self-contained, requiring no DBMS or cloud service, features an OpenAPI interface for easy integration, and supports consumer-grade GPUs. Recent updates include Llamafile deployment integration, an enhanced Answer Engine user experience, the ability to switch between different backend chat models in the Answer Engine, and displaying recently shared threads on the main page. It also boasts IDE plugins and enterprise features like team management and secured access.

Read more
Development AI coding assistant

56-Byte Assembly Snake Game

2025-01-12
56-Byte Assembly Snake Game

A developer on GitHub has open-sourced a remarkably compact Snake game written in assembly language, clocking in at a mere 56 bytes. This minimal game runs not only under DOS, but also without a BIOS, bootloader, or operating system. The developer compressed the code to this incredibly small size and provides both an online demo and a self-hosting version. The project has sparked discussions about code minimization and game development, with comparisons to other similar projects highlighting its significant advantage in code size.

Read more
1 2 32 33 34 36 38 39 40 41 42