My Genie Got It Wrong: Evaluating LLMs for a RAG Chatbot How do you choose the right LLM for a RAG chatbot? I compared Llama 70B, 405B, and GPT-4 across 100 iterations. The AI agent's recommendation was wrong.
Have Your Slice and Eat It: Boost Quality with Vertical Story Slicing How product features are sliced into smaller work items has a big impact on quality. Slice your epics well, and quality will follow.
Why Software Teams Need Speed Limits Reducing the speed limit before a known bottleneck allows traffic to flow smoothly. While we feel we're going slower, we actually get to our destination faster. The same principles apply in software engineering.
Using AI in Quality Coaching Many teams are keen to explore how AI can help them in improve test coverage or reduce the time required for testing.
Experiments in Quality Coaching Use the concept of experiments to encourage product teams to try new approaches, tools and ways of working