Aaron Westley - Slinging Bits

Incremental Context Testing for LLMs: A Simple Script for Stress Testing Limits

08 Apr 2025 2 min read

After benchmarking the R1 1776 model and seeing how post training influenced its performance (full post here), I realized another gap. Models that can technically handle a huge context window often degrade long

Benchmarking R1 1776: A Post-Trained DeepSeek R1 671B Model

04 Apr 2025 2 min read

Purpose This benchmark measures the real-world inference performance of Perplexity AI’s R1 1776 model, a post-trained version of DeepSeek R1 671B, designed to eliminate censorship and enhance unbiased information delivery under controlled

Who's the boss

22 Mar 2025 3 min read

People have been having conversations for thousands of years. We’re wired for it. But we’re not wired for talking to something that doesn’t understand social cues, the subtle, unspoken signals

End-to-end test of my "Machine-Augmented Response, Voice, and Information Node" (M.A.R.V.I.N.).

12 Mar 2025

Can I Build My Own JARVIS? Let’s Find Out.

26 Feb 2025 4 min read

For years, I’ve been fascinated by AI assistants. They are useful, sure, but they always seem to be missing something. We all want a JARVIS from Iron Man or the computer from