That was fast! I feel like I just wrote January's roundup. I spent a fair amount of time on the road throughout the month. There were also a few really exciting projects at work that kept me busy. I was still able to push a few updates to the NBA Now feed, largely focused on reliability and quality to continue building the best NBA community experience on Bluesky!
I'm really happy to have started this NBA Now community last month. Supporting and improving the feeds have given me a lot of great experience working with the atproto and thinking through problems associated with supporting a production system.
What I've been Building
NBA Now Continues to Grow! 📈
I've started treating my NBA Now Bluesky feed as a real product. The primary update for this month was adding a user propensity score to the stream processing to filter out posts that are unlikely to be NBA related. A common scenario this solves for is when a post contains a keyword like "Spurs". Does this refer to the NBA team of the EPL team of the same name? Sometimes the post itself doesn't include enough context to make that determination. The User Propensity Score helps filter out the non-NBA posts by looking at the user's posting history. If the user has never posted about the NBA before, it's likely the post is about the EPL team and can be omitted from the feed. The process isn't perfect, but it has been an effective first step towards keeping the feed focused on NBA content.
Vibes out. Data in.
Implementing the user propensity score was largely driven by vibes. On EPL game days I was seeing posts clearly about Tottenham in the feed. After implementing the score the number of Tottenham posts dropped off significantly. Vibes only go so far though. Treating the feed as a product means shifting decision making to be more data-driven instead of following the vibes. This past month I've been working on setting up metrics to monitor the feed.
One metric I've been tracking more formally is the daily active users (DAU). Tracking the DAU gives me a sense of how much value users are getting from the feeds. Anyone who visits one of the NBA Now feeds on a given day is counted as a DAU. Through the month the feed has averaged around 60 DAU. Since the feed launched about 300 users have visited the feed at least once, which means the conversion rate to being an active user is around 20%, a number I'm happy with. The consistent DAU number is a good sign that the feed is providing value to users making them want to come back every day.
Of the four NBA feeds I've built, NBA Now, Highlights, Reads, and Trending, NBA Now has by far the largest audience. One other fix this month was dropping periodic posts into the feed to promote the other feeds (Highlights, Reads, and Trending). Through this action the activity on the NBA Reads feed has been growing, finding its own steady audience.
Measuring Feed Quality with a LLM Judge
A second metric I'm prioritizing is feed quality. Feed quality is measured by the percentage of posts in the feed that are related to the NBA. The objective is to minimize the amount of non-NBA content in the feed. Measuring this is a bit tricky. Manually reviewing the thousands of posts in the feed is not feasible. To reduce the amount of manual work I implemented a LLM Judge to evaluate the classification of each post.
Each day I take a sample of posts from the feed and instruct another LLM to evaluate the post and classification to judge whether the post is related to the NBA. While the judge is not 100% accurate, it has been a helpful tool for monitoring feed quality without much effort.
I'll be sharing more about the LLM Judge implementation once it's finalized. One improvement I'm working through is to use the OpenAI Batch API for the judge calls. Since I don't need the responses immediately the batched calls save API costs, which is worthwhile since the latency isn't critical.
In the meantime, early experiments have put the feed quality at 96%! This is a good sign the feed is staying on track and not delivering much non-NBA content. One other benefit of the LLM Judge is that by performing an error analysis I can see where posts are incorrectly included or excluded in the feed to identify opportunities for improvement!
Next Steps for NBA Now
Largely I'm going to continue to focus on reliability and quality. Without those the feed becomes useless. This includes making sure the LLM Judge pipeline is set up for monitoring and establishing a workflow for identifying areas of improvement. Part of the quality process is being able to quantify impacts of changes. Having monitoring in place allows me to expand the scope of what the feed covers to capture more NBA content without sacrificing quality.
A second objective is to grow the audience. There's a consistent audience using the feed today, which is good signal people find value in the feeds. How can I grow that audience? This task is a bit more challenging for me. Self-promotion and marketing are not my strong suit. I don't want to spam promos for the feeds every day, I'd like to find a way to grow the audience organically. Still figuring this one out.
Finally, there is one more feed I'm working on for the NBA Now collection. I've started working on a blueprint for the new feed, but will hopefully be able to focus more on the development this month.
What I've been Reading
AI Engineering by Chip Huyen continues to be excellent. This has been a slot read for good reason. The book is so full of information I'm taking my time to digest each chapter. Right now I'm working through the chapters on evaluating LLM systems. I cannot recommend this book enough!
Foundation's Edge by Isaac Asimov. I finally finished this one and started the next book in the series. After a long lapse in this series I'm back to enjoying it again!