LLM Function Calls Don't Scale: Code Orchestration Is Simpler, More Effective

2025-05-21
LLM Function Calls Don't Scale: Code Orchestration Is Simpler, More Effective

Feeding the full output of tool calls back into LLMs is costly and slow. This article argues that output schemas, enabling structured data retrieval, allow LLMs to orchestrate processing via generated code – a simpler and more effective approach. Traditional methods, where tool outputs are fed back to the LLM as messages for next-step determination, work well with small datasets but fail with real-world scale (e.g., large JSON blobs from Linear and Intercom MCP servers). The article proposes code execution as a fundamental data processing method, using variables as memory, and code to orchestrate multiple function calls for scalable data processing, overcoming the cost, speed, and potential data loss issues of LLMs handling large datasets. This necessitates secure, stateless AI runtime environments, currently in early development.

Read more
Development Code Orchestration

Terminator: The AI Coding Assistant Guardian Angel

2025-05-24
Terminator: The AI Coding Assistant Guardian Angel

Tired of AI coding assistants like Cursor being interrupted by stuck command loops? Terminator, a powerful AppleScript-driven terminal session manager, solves this problem! It achieves process isolation by running commands in separate terminal sessions, keeping your AI assistant responsive even with hanging commands. Terminator creates and manages persistent terminal sessions, isolates command execution, intelligently interrupts busy processes, and provides reliable session state management. With simple commands, you can easily train your AI assistant to use Terminator, boosting efficiency and avoiding frustrating workflow interruptions.

Read more

arXivLabs: Experimenting with Community Collaboration

2025-06-12
arXivLabs: Experimenting with Community Collaboration

arXivLabs is a framework enabling collaborators to develop and share new arXiv features directly on the website. Individuals and organizations involved share arXiv's values of openness, community, excellence, and user data privacy. arXiv is committed to these values and only works with partners who adhere to them. Got an idea to enhance the arXiv community? Learn more about arXivLabs.

Read more
Development

Analyzing Lone Wolf Gamebooks with Graph Theory

2025-09-23

The author encoded the Lone Wolf series of gamebooks as directed graph networks and used graph theory algorithms to analyze their properties. The Dawn of the Darklords was excluded from the analysis as it wasn't officially released as a gamebook. The analysis covered 28 books across four series, calculating the shortest path to the ending, the shortest path to death, the path with the most fights, and other statistics for each series. Results showed a decrease in difficulty and an increase in adventure and story focus over time. Technical details like handling disconnected graphs and cycle removal were also discussed.

Read more
Game Lone Wolf

Unpublished Memoir of CP/M Creator Gary Kildall Released

2025-07-18
Unpublished Memoir of CP/M Creator Gary Kildall Released

A portion of an unfinished memoir by Gary Kildall, the creator of the CP/M operating system, has been released by the Computer History Museum. Written before his death in 1994, the excerpt details Kildall's early life and entrepreneurial journey, emphasizing his values of invention and a love of life over profit. Later chapters, detailing his struggles with alcoholism, will remain unpublished.

Read more
Tech

AI-Assisted Development: The Architect's Ascent

2025-05-27
AI-Assisted Development: The Architect's Ascent

From NoCode to AI-assisted development, technological advancements haven't rendered developers obsolete; instead, they've reshaped roles and skill requirements. Past waves of NoCode, cloud computing, and offshore development didn't eliminate developers but created new specializations like NoCode specialists and DevOps engineers, often with higher salaries. AI-assisted development follows this pattern. AI excels at code generation, but it struggles with system architecture. Thus, the most valuable skill in software engineering will be system architecture—a domain where AI lags significantly. AI accelerates development, increasing the complexity of system maintenance and highlighting the crucial role of experienced architects in managing and mitigating risks.

Read more
Development

Pac-Man Superfast: A Speedy Blast from the Past, But Short-Lived

2025-06-21
Pac-Man Superfast: A Speedy Blast from the Past, But Short-Lived

YouTube Playables' Pac-Man Superfast delivers a high-speed Pac-Man experience reminiscent of the Championship Edition series, but with only four ghosts and a mere 13 levels. While the intense gameplay is thrilling, the lack of a 256-level mode, leaderboards, and rewards for extra lives and high scores limit replayability and competitive aspects. Despite this, the challenging gameplay and high-score pursuit make it worth a try, especially for classic arcade game fans. But hurry, before it vanishes like other defunct Google projects.

Read more
Game

Critical Security Flaw in Jitsi's Public Instance: Unauthorized Mic and Camera Access

2025-07-24
Critical Security Flaw in Jitsi's Public Instance: Unauthorized Mic and Camera Access

A critical security vulnerability has been discovered in Jitsi's public instance, an open-source video conferencing application. Attackers can silently initiate a Jitsi meeting in the background by tricking users into visiting a malicious link, gaining unauthorized access to their microphones and cameras. Jitsi claims this is a 'feature' and refuses to fix it. The vulnerability exploits previously granted permissions, allowing attackers to capture audio and video even without the user's knowledge or interaction. The author urges Jitsi to at least remove this 'feature' from the public instance to mitigate the significant security risk.

Read more
Tech

Ballista Botnet Exploits TP-Link Router Flaw, Infecting 6,000+ Devices

2025-03-11
Ballista Botnet Exploits TP-Link Router Flaw, Infecting 6,000+ Devices

A new botnet, Ballista, is exploiting a high-severity vulnerability (CVE-2023-1389) in unpatched TP-Link Archer AX-21 routers, infecting over 6,000 devices. The vulnerability allows remote code execution, enabling Ballista to spread automatically via command injection. The botnet targets manufacturing, medical, services, and technology organizations, predominantly in Brazil, Poland, the UK, Bulgaria, and Turkey, but also impacting the US, Australia, China, and Mexico. Ballista uses a malware dropper and shell script to execute its main binary, establishing a C2 channel to control infected devices and perform DoS attacks and sensitive file reading. Researchers suspect an Italian origin, but the use of Tor networks suggests ongoing development and active evasion techniques.

Read more
Tech botnet

Archaeologists Use Lewis & Clark's Laxatives to Find Lost Campsites

2025-09-01

The Lewis and Clark expedition's 600 giant laxative pills, nicknamed "thunder-clappers," contained mercury, a stable compound. Traces of these pills are helping archaeologists pinpoint the expedition's campsites. High mercury levels in soil indicate old latrine pits, and military manuals help reconstruct the camp layouts. This discovery highlights the limitations of early 19th-century medical practices, where "heroic medicine", while sometimes effective, often did more harm than good.

Read more
Tech

Kahuna: Your IndexedDB Swiss Army Knife

2025-04-07
Kahuna: Your IndexedDB Swiss Army Knife

Kahuna is a browser extension for Firefox and Chromium-based browsers that simplifies IndexedDB database management. It lets you create, modify, view, query, edit, import, and export IndexedDB data. Features include data filtering, pagination, JavaScript code execution, and import/export in various formats (Dexie, JSON, CSV). While documentation is a work in progress, Kahuna is a powerful tool for developers working with IndexedDB.

Read more
Development

Duke Students Throw Away Thousands in Unused Goods

2025-05-27
Duke Students Throw Away Thousands in Unused Goods

A writer living in a Durham apartment building populated largely by Duke University students discovered a treasure trove of discarded items during the end-of-year move-out. High-end goods, including a $900 acrylic table, $395 Balenciaga slides, and over $1000 worth of Lululemon clothing, were found in the building's trash room. The author meticulously documented the items, totaling approximately $6000 in value. A comparison of Duke's donation program with other universities revealed comparable donation rates among wealthy private institutions. The story highlights issues of consumerism, waste, and the effectiveness of university donation initiatives.

Read more

Beyond Capitalism: A Reciprocal Model of Technological Innovation

2025-02-09
Beyond Capitalism: A Reciprocal Model of Technological Innovation

This article critiques the flawed notion of equating technological progress with capitalism. Using the "iPhone fallacy" as an example, the author argues that humanity could possess technology even without capitalism, and that technological innovation should break free from capitalist constraints. The article uses the Mesoamerican milpa agricultural system as an example to illustrate a non-capitalist, reciprocity-based model of technological innovation. This model emphasizes collective wisdom and harmonious coexistence with nature, rather than profit maximization. The author calls for breaking the constraints of the patent system and promoting open-source technological innovation to address global challenges such as climate change.

Read more

Fake Deadlines: A Manager's Secret Weapon?

2025-04-02
Fake Deadlines: A Manager's Secret Weapon?

This article explores the effectiveness of 'fake deadlines' in project management. Drawing on personal experience and the insights of James Stanier, the author argues that setting challenging deadlines leverages Parkinson's Law (work expands to fill the time available), boosting team efficiency and driving project progress. However, the author emphasizes that success hinges on team involvement, clear goals, and open communication, avoiding negative impacts like forced overtime. The ultimate goal is enhanced team productivity, not simply on-time delivery.

Read more
Development deadlines

VMware's Partner Purge: Broadcom Shakes Up the Cloud Again

2025-07-17
VMware's Partner Purge: Broadcom Shakes Up the Cloud Again

Broadcom, VMware's parent company, is once again drastically reshaping its partner program, leaving many smaller players out in the cold. The changes, effective October 31st, 2025, will sunset the white label program and prevent uninvited partners from signing new contracts. This has sparked outrage among partners and customers who face challenges renewing licenses, potential service quality drops, and increased migration costs. This isn't Broadcom's first major partner shakeup; a previous cull caused significant instability. While Broadcom claims these moves optimize its private cloud strategy, its erratic approach has eroded trust.

Read more

ISS Leak Mystery Delays Ax-4 Mission: A New Twist in an Old Problem

2025-06-29
ISS Leak Mystery Delays Ax-4 Mission: A New Twist in an Old Problem

A slow leak from a Russian module on the International Space Station (ISS), ongoing for years, has recently stopped, raising concerns. This could be due to successful repairs, or a new leak may have formed internally, potentially affecting the entire station's air pressure. The private Axiom Space Mission 4 (Ax-4) was delayed as a result, while NASA and Roscosmos investigate. Disagreements persist on the safety risk assessment. Ax-4 includes former NASA astronaut Peggy Whitson and the first astronauts from India, Poland, and Hungary to visit the ISS. Despite the ongoing leak issue, the Crew-11 mission is still scheduled for July.

Read more
Tech leak

AI Code Writing: A Breakthrough with Darwin-Gödel Machines

2025-06-26
AI Code Writing: A Breakthrough with Darwin-Gödel Machines

Microsoft and Google's CEOs have both stated that AI now writes a significant portion of their company's code. New research introduces a system called Darwin-Gödel Machines (DGMs), which uses a combination of large language models and evolutionary algorithms to achieve recursive self-improvement in code-writing agents. DGMs significantly improved performance on coding benchmarks through iterative refinement, even surpassing systems using fixed external improvement methods. While current DGM performance doesn't exceed human experts, it showcases immense potential and sparks discussion about AI safety and risks.

Read more
AI

Perplexity CEO: Big Tech Will Copy Your Good Ideas – Get Used To It

2025-07-16
Perplexity CEO: Big Tech Will Copy Your Good Ideas – Get Used To It

Perplexity CEO Aravind Srinivas advises young entrepreneurs to expect their good ideas to be copied by larger companies. He uses Perplexity's web-crawling feature as an example, highlighting how large tech firms, with their massive capital, constantly seek new revenue streams and will copy anything valuable. He encourages startups to work hard, anticipate this competitive landscape, and be wary of potential suppression tactics, such as monopolistic behavior used to stifle competition.

Read more
Startup

Doomsday Clock Ticking: Hegel and the Future of Humanity

2025-03-31
Doomsday Clock Ticking: Hegel and the Future of Humanity

This article reviews Richard Bourke's new book, *Hegel's World Revolutions*. Against the backdrop of the ever-approaching Doomsday Clock, the author explores Hegel's philosophy of history, arguing that it offers insights into the current global crisis. Hegel believed history is not meaningless but progresses towards the advancement of consciousness of freedom. Bourke emphasizes the complexity of history, including both progress and setbacks, and criticizes the contemporary academia's simplistic rejection of Enlightenment values. The article ultimately questions Bourke's view that historical research should avoid applying past ideas to the present, suggesting that in the face of existential threats, we should draw on historical wisdom to seek solutions.

Read more
(drb.ie)

GitHub Action: Auto-Posting RSS Feeds to Bluesky

2025-01-30
GitHub Action: Auto-Posting RSS Feeds to Bluesky

Blueskyfeedbot is a GitHub Action that automates posting RSS/ATOM feeds to Bluesky. Users create a GitHub repository, configure secrets (Bluesky username and app password), and set up a workflow file specifying the RSS feed URL and a Handlebars template. This simplifies content syndication to Bluesky, offering a convenient solution for bloggers and developers.

Read more
Development

A Forty-Year Quest for a Childhood Story

2025-01-09
A Forty-Year Quest for a Childhood Story

The author recounts a forty-year journey to rediscover a cherished childhood story from a purple book. His quest, utilizing online resources and libraries, was repeatedly thwarted by inaccurate information generated by AI tools. Ultimately, an experienced librarian's expertise led to the discovery of the book containing the story, "From Michaelmas to Candlemas." Contacting the author's relatives yielded the original manuscript. This tale highlights the perseverance of the search and underscores the irreplaceable value of human expertise in the age of AI.

Read more

Agentic Coding Assistants: Impressive Progress, Persistent Challenges

2025-03-26
Agentic Coding Assistants: Impressive Progress, Persistent Challenges

Generative AI, particularly LLMs, is revolutionizing software development. This memo details the author's experience using AI coding assistants over several months, revealing significant efficiency gains alongside persistent challenges. The AI frequently misdiagnoses problems, uses brute-force fixes, lacks code reusability, and generates redundant code, impacting team workflow and long-term maintainability. The author categorizes these issues into three impact radiuses: time to commit, team flow, and long-term maintainability, and offers mitigation strategies such as careful code review, regular reflection, and establishing code quality monitoring mechanisms. The core message is that despite rapid AI advancements, developer experience and skills remain crucial.

Read more
Development

Windows 11 File Explorer Gets AI Shortcuts

2025-05-20
Windows 11 File Explorer Gets AI Shortcuts

Microsoft is integrating AI shortcuts, called AI actions, into Windows 11's File Explorer. These allow right-clicking a file to access Windows AI features like blurring photo backgrounds, erasing objects, or summarizing Office files. Four image actions are currently being tested, including Bing visual search, background blur and object removal (from the Photos app), and background removal in Paint. Similar AI actions for Office files are planned, enabling summarization of OneDrive/SharePoint documents and AI-generated lists. This will initially be for Microsoft 365 commercial subscribers with Copilot; consumer support is coming later. Alongside this, Windows 11 widgets are getting a visual refresh with Copilot-curated stories. A new "User Interaction-Aware CPU Power Management" feature aims to reduce power consumption on inactive laptops and tablets.

Read more
Tech

Underwater Octopus Cities Discovered in Australia

2025-08-19
Underwater Octopus Cities Discovered in Australia

Off the coast of Jervis Bay, Australia, two remarkable octopus settlements, dubbed 'Octopolis' and 'Octlantis,' have been discovered. These bustling communities of gloomy octopuses (Octopus tetricus) utilize shells to construct their dens, creating unique and densely populated habitats. Octopolis, the first discovered, even contains a piece of human-made debris. While often sensationalized as 'cities' in the media, researchers emphasize this is a metaphorical description, highlighting the complex social behaviors and surprising engineering skills of these cephalopods.

Read more
Tech

Open-Source Toolkit: Assessing and Mitigating Hallucination Risk in LLMs

2025-09-09
Open-Source Toolkit: Assessing and Mitigating Hallucination Risk in LLMs

Hassana Labs has released an open-source toolkit for assessing and mitigating hallucination risk in large language models (LLMs). Without requiring model retraining, the toolkit leverages the OpenAI Chat Completions API. It creates an ensemble of content-weakened prompts (rolling priors) to calculate an upper bound on hallucination risk using the Expectation-level Decompression Law (EDFL). A decision to answer or refuse is made based on a target service-level agreement (SLA). Supporting both evidence-based and closed-book deployment modes, the toolkit provides comprehensive metrics and an audit trail for building more reliable LLM applications.

Read more

Musk's DOGE Team: A 19-Year-Old Hacker and a Massive Government Data Breach

2025-02-09

Wired revealed that a 19-year-old working for Elon Musk's so-called "Department of Government Efficiency" (DOGE) gained access to sensitive US government systems despite his past association with cybercrime communities. This teen, a former member of 'The Com,' a distributed cybercriminal network, has raised serious concerns. Since Trump's second inauguration, DOGE has accessed vast amounts of sensitive data, controlling databases at the Treasury, OPM, and other departments. The 19-year-old, Edward Coristine, known online as "Big Balls," founded Tesla.Sexy LLC and runs the ISP Packetware, with links to cybercrime. His past actions are incompatible with government security clearance standards, leading to significant security risks and widespread lawsuits.

Read more

Mining Atari Games from Random Data: A Computationally Intensive Treasure Hunt

2025-06-10
Mining Atari Games from Random Data: A Computationally Intensive Treasure Hunt

This project attempted to 'mine' Atari 2600 games from 30 billion 4KB files of random data. Using clever heuristics and massive GPU parallelization, the author drastically reduced the search space. The project unearthed ROMs that ran and produced interesting visual output in an emulator, even discovering a 'proto-game' responding to player input. This proves that even in completely random data, information with specific characteristics can be found with the right approach. The experiment offers new avenues for exploring vast possibilities using computational resources.

Read more
Game

Power Efficiency Showdown: Simulating Voltage Boosting Circuits

2025-04-28
Power Efficiency Showdown: Simulating Voltage Boosting Circuits

This post compares the power efficiency of several voltage boosting circuits using the Lush Projects circuit simulator. Circuits tested include a buck converter, parallel buck converter, serial buck converter, pulsed transformer, and Joule thief. All circuits boosted a 5V DC input to a stable 10V output, measured across a 1kΩ resistor load. The parallel buck converter proved most efficient (92.73%), followed by the serial (91.32%) and standard buck converter (88.43%). The pulsed transformer was least efficient (73.85%), while the Joule thief lagged far behind at only 22%. The author discusses component choices (capacitors, resistors, MOSFETs) and their impact on efficiency.

Read more

Boost UI Design Efficiency: Prioritize Global Consistency Over Local Optimization

2025-09-19
Boost UI Design Efficiency: Prioritize Global Consistency Over Local Optimization

While redesigning Lighthouse, the author developed a system for creating better UI designs with less effort. The core principle is prioritizing global UI consistency over local perfection. This involves selecting and fully utilizing a component library (like HeroUI), avoiding custom components; using only two font weights and two text colors; maintaining visual consistency between icons and text; and creating and adhering to a project-specific design rule document. These strategies significantly improved design efficiency and resulted in a smoother, more usable interface.

Read more
1 2 134 135 136 138 140 141 142 596 597