Data & AI Leader

Alexander Liss

I lead teams across data science, data engineering and analytics. With twelve years building digital experiences and products across Fortune 100 scale, I've learned that the hardest part is never the technology. It's the organizational alignment. I also do original research in AI systems design, because as AI-native organizations become learning systems, creating feedback loops through data and analytics is more important than ever.

Alexander Liss
§ 01

Selected papers.

Published work and preprints. Links to PDFs and code where available.

01

Attention Fine-Tuning (AFT)

Alexander Liss

The next frontier in AI is models that improve themselves. Every major lab is racing toward systems that evaluate their own outputs, identify weaknesses, and update accordingly, like Google's AlphaEvolve. The problem is the reward signal from RLHF. Human preference labels are expensive, brittle, and external reward shaping too often results in model collapse.

But what if the reward signal was in the model the whole time?

This paper introduces Attention Fine-Tuning (AFT): a post-training framework that derives its reward signal entirely from within the model, with no human labels required. In internal testing as we developed this framework, we achieve significant results including:

  • A 9.2% improvement in conversation quality over an SFT baseline on 7,372 test examples
  • 95.5% of test examples improved on attentional focus, with 100% improvement on mid- and late-dialogue turns
  • A mapping of attractor states that draw conversational systems into semantic collapse

This framework is applicable to environments where optimizing the ability of an LLM to hold a conversation is critical, including customer service, education, personal assistants, and video games. It allows post-training without manually labelled preference data.

02

Experience Orchestrator (EO)

Alexander Liss, Nicholas Desmond, Santiago Gil Gallego

In 2026, multi-agent ecosystems are everywhere. OpenClaw, MoltBook, and their successors let agents act autonomously in open-ended environments. And the world is discovering that prompting them toward good behavior doesn't scale. What's missing is a general framework for keeping LLM-based agents aligned with optimal behavior.

This paper presents the Experience Orchestrator, a formal framework for applying control theory to govern large language models as independent agents. In internal testing as we developed this framework, we achieved significant results including:

  • Measurable lift of +32 pts in goal completion rate over a naive LLM approach.
  • LLMs that move beyond 'annoyingly friendly' defaults toward nuanced, humanlike conversational intent.
  • Rich dialogue histories that provide training data for further iteration.

The framework generalizes to any environment where agents must make decisions under partial observability. It provides a way to govern LLM agents through a shared policy, similar to classical multi-agent reinforcement learning, but applicable to the world of OpenClaw.

§ 02

Notes & essays.

Longer-form thinking on my Substack.

01

For Humans, love is the drug. But for AI, it's the reward signal.

Substack post announcing my reward signal thesis

02

From clicks to cognition: digital experiences are becoming learning systems.

Your website isn't a destination anymore. It's training data.

§ 03

A bit more about me.

Currently

Engineer and researcher working at the intersection of AI systems design and applied machine learning. My current focus is the reward signal problem: how do you give an AI system a persistent, measurable sense of whether it's getting closer to what actually matters? That question is driving two active research programs. One looks for the signal inside the model; the other governs the system from above.

Education

  • MS, Computer Science, Georgia Institute of Technology, AI Specialization (in progress)
  • MBA, NYU Stern School of Business, Business Analytics Specialization
  • BA, George Washington University, Japanese Language and Literature

Interests

  • Reward signal design and post-training methods
  • Multi-agent systems and control theory
  • Reinforcement learning
  • Cognition as a dynamic system

On the side

Listener of Huberman Lab, runner, skier, anime fan, and father of human children and dogs. I also organize the Denver Data Dudes & Dudettes meetup. Come find us if you're in Denver and thinking about data.

§ 04

Contact.

Happy to chat about research, writing, or collaboration.