Data & Daydreams (Blog)

Some thoughts I'm trying to think through more completely. Writing them down forces me to articulate my reasoning from start to finish. This is mostly just for curiosity, fun, and scratching my ever-present itch to optimize things that don't necessarily need optimizing.

2026

Breaking Deep Research: Where Retail User LLM Search Agents Fail and Why Verification Still Falls on You

15 minute read

Published:

Deep research tools from OpenAI, Google, and Perplexity promise source-grounded synthesis, but their reliability depends heavily on what they are searching. Professional tools connected to peer-reviewed databases like PubMed have a built-in quality gate: the barrier to publication is literally peer review. But for code repositories, blog posts, and other sources with no editorial barrier, deep research inherits every error, exaggeration, and fabrication in the source material. Digging into some of the recent literature myself, it looks like studies often find citation accuracy as low as 40% and fabricated references. I tested all three tools on the same library-evaluation question as a small experiment to show how they can fail (a la proof by contradiction). One tool recommended an abandoned library as its top pick with an apparently fabricated release date. The issue of AI-generated content infiltrating peer review itself is a separate and important problem, but not within the scope of this post.

2025

Pure Indexing, Flirting with Factors, and Finding Intermediaries: Thoughts on Addressing Three-Fund Portfolio Inefficiencies

44 minute read

Published:

After reading Ang’s ‘Asset Management: A Systematic Approach to Factor Investing’ and rereading the newest 50th anniversary edition of Malkiel’s ‘A Random Walk Down Wall Street,’ I wanted to explore how funds like Dimensional’s DFUS address indexing inefficiencies without fully committing to factor tilts. The goal: capture market beta with index inefficiencies such as adverse selection addressed. 5-factor regressions indicate DFUS is a VTI equivalent for US equities, but a VXUS equivalent remains unsolved. This is not financial advice and I am just your average retail investor.