Linus SeahLinus Seah
  • About
  • Projects
  • Articles

Articles

These days I spend most of my time thinking about the intersection of business strategy, product thinking, and AI. I'm particularly interested in how organizations can adopt AI tools practically. Beyond the technical side, I’m curious about what AI means for the world at large; how it will transform work, how future generations will learn, whether it narrows or widens inequality, and what it ultimately says about the value of human judgment and our responsibility in guiding its development..

I learn by building. My current focus is on understanding agentic AI systems, what they can actually do, where they break down, and how to evaluate the trade-offs between complexity and value. I write about what I learn so that others navigating the same questions can benefit from the experiments I've already run.

  • What Enterprises Actually Need to Go AI-Native
    March 2026 · 15 min read · Applied AI Thinking for Operators · Part 1
    The technology is the easy part. Here is what actually determines whether enterprise AI adaptation succeeds or fails. From organizational readiness to workflow redesign, unpacking the four fronts that matter most.
    Enterprise AI Organizational Design Agentic AI Digital Transformation
  • Navigating an AI-Pilled Labor Market: Thoughts on Workforce Adaptation
    March 2026 · 14 min read
    Two questions keep coming up in every conversation I have about AI and work: what should I study, and what skills should I be building? Here is what current research actually says, and what to do about it. Includes full research paper on AI's labor market impact.
    AI Labor Markets Career Economics Future of Work
  • Open World Models, Physical AI, and the Road to AGI
    March 2026 · 18 min read · Applied AI Thinking for Operators
    Reflections from working with the research team at Genmo: what open world models actually are, how they differ from physical AI and VLAs, why the LLM playbook doesn't transfer, and what needs to be true to push this frontier forward.
    World Models Physical AI Robotics AGI AI Research
  • When Your User Isn't You
    March 2026 · 11 min read · Applied AI Thinking for Operators · Part 3 of 4
    Scaling from a single personal agent to a multi-agent production pipeline, and what that progression forced me to rethink. From ICP discovery with Exa's findSimilar to 3-tier engagement modeling to qualification as explainability, not filtering.
    AI Agents Lead Generation Solutions Architecture Agent Orchestration
  • From Demo to Production
    March 2026 · 13 min read · Applied AI Thinking for Operators · Part 4 of 4
    Three iterations, two complete rebuilds, and what I learned about picking the right stack for the right stage of a project. Why Streamlit → Next.js, JSON → Supabase, and what filter-vs-qualifier taught me about product philosophy.
    Next.js Supabase Vercel Solutions Architecture
  • How Do We Know If an AI Model Is Actually Good?
    February 2026 · 12 min read · Evaluation Series Part 1 of 2
    The gap between benchmark scores and real-world performance is wider than the leaderboards suggest. Understanding the three layers of model evaluation (general capability, safety, and task-specific performance), why benchmarks degrade over time, and the case for building your own evaluation framework.
    Model Evaluation LLM Benchmarks Applied AI AI Agents
  • Building an LLM Judge to Evaluate My AI Digest Agent
    February 2026 · 13 min read · Evaluation Series Part 2 of 2
    Using a language model to score another language model's output sounds circular. An eight-dimension custom rubric, Pearson calibration to align the judge with my taste, what the scores revealed about source diversity issues, and why evaluation costs more than generation.
    LLM Evaluation LLM-as-a-Judge Calibration Claude Opus
  • What "Agent" Actually Means: Lessons from Building My Morning News Digest
    February 2026 · 10 min read · Part 1 of 2
    I built the same thing three times — each with a different level of "agency." Here's what I learned about what that word actually means, why automatic doesn't equal autonomous, and how to think about the spectrum of agency in AI systems.
    AI Agents LLM Architecture Building in Public
  • The Technical Playbook: Building a Personal AI Digest from Scratch
    February 2026 · 10 min read · Part 2 of 2
    The implementation details: Claude Agent SDK, Exa Search, free vs. paid models, every bug I hit, cost analysis, and the fallback pattern that makes agentic systems production-ready. Full code walkthroughs included.
    AI Engineering Claude Agent SDK Exa Search Python
  • Inside Out: ML & Lexical Rule-Based Emotion Classifier for Text
    January 2020 · Capstone Project
    A Flask web application that analyses textual input to classify emotions at the sentence level, combining a Logistic Regression model (trained on ~500k tagged observations) with VADER valence scoring and TextBlob phrase extraction. Deployed on Heroku.
    NLP Classification Flask Heroku Python
  • Web Scraping and Classifying Posts from Reddit
    December 2019 · GA DSI Project 3
    Web scraping posts from two subreddits via the Pushshift API and applying NLP and classification modelling (Logistic Regression, Naive Bayes, Random Forest) to accurately distinguish between communities.
    NLP Web Scraping Classification Python
  • Regression and Classification with Housing Data
    January 2019 · GA DSI Project 2
    Using the Ames housing dataset to estimate sale prices and identify features that predict abnormal sales (foreclosures). Covers the full ML project framework: EDA, feature engineering, Lasso/Ridge regression, and model evaluation.
    Regression Classification EDA Python
Linus Seah · 2026