This is a story of how you can find a bug before users report it, but don’t end up fixing it due to other priorities or communication breakdown.
I was trying to investigate a bug with our software, and ventured into a related module to try and configure it. However, it crashed. After looking at the code where the crash occurred, I found the developer that had likely caused it, and it was a change he made in the previous month. I sent a message directly to Paul to alert him to it. There was a chance the change hadn’t been released yet, and he could fix it in time. Or if it had been released, then it probably needed to be fixed urgently because the feature looked unusable as far as I could tell.
Paul replied stating that he had realised his change had caused a bug elsewhere and this was essentially the same mistake, so he would add it to his list of things to fix. However, he was going on annual leave in a couple of days so would need to hand it over to his team.
He sent an email to his team and copied me in.
Recently my change had caused a bug in Resource Manager. If you archive and unarchive a template it causes a crash. It became evident that there is the potential for two more crashes to occur that are related.
Schedules (clicking on any folder, demonstrated by Praveen)
Assessment (found today, but we believe this only is used by 1 site)
Root cause is that the control that I changed is the same in these other areas, however these “resources” do not have a file size.
I have created a bug for it, and raised a draft change which I believe would fix the issues (I have done some local testing but more required).
This could be picked up by someone else in my absence if it needs to be progressed asap (code is live and not behind a feature flag).
These discoveries have been through our own efforts, not through Support.
Needless to say, this feature has had its fair share of issues, and I will write up a report/lessons learned document upon my return.
A few days later, Paul returns back from his annual leave.
[15:13] Paul Marshall
you know that bug in assessments module you highlighted the other day?
Major Incident raised. Only 2 complaints so far, but we have found more occurrences when browsing the error logs.
[15:35] Paul Marshall
I raised the PR (draft) but did not progress it as I was literally going on holiday that day. I emailed a bunch of folks and gave the full picture to them, including all the evidence and PR but it wasn’t moved on.
When I was back I raised it and it sat there not reviewed by the team even though I asked repeatedly to have it reviewed.
Sorry, bit of a rant there....
its moving now
[15:40] Me
let's write a blog post for Viva Engage about how we came close to preventing a Major Incident
others foiled our attempt
Emails were flying, managers were sighing, and somewhere, a developer was crying.
AI
The Curious Case of Paul’s Holiday: A Software Saga
Once upon a time, in the not-so-far-away land of Software Developmentia, there was a developer named Paul. Paul was a diligent worker, known for his sharp eye for detail and his love for datagrids. One fateful day, Paul made a change to a datagrid that was nothing short of genius… for a certain group of people. For another group, however, it was a disaster waiting to happen.
Paul, with his developer senses tingling, caught the potential calamity before it unleashed chaos upon the unsuspecting second group. He crafted a fix as elegant as a ballroom dance and sent out an email to his manager and team, detailing the issue, the solution, and the testing needed. It was a masterpiece of communication, a symphony of technical jargon and urgent pleas. And with that, Paul packed his bags and set off on a well-deserved holiday, visions of bug-free code dancing in his head.
But alas, in the twist of all twists, Paul’s team, perhaps too mesmerized by his email’s eloquence or too entranced by their own holiday plans, did nothing. The fix sat, lonely and unimplemented, as Paul’s toes wiggled in the sand and his out-of-office reply cheerfully told people he’d be back soon.
Upon his return, Paul found his inbox flooded with the digital cries of users caught in the datagrid debacle. The issue had gone live, and chaos reigned. Emails were flying, managers were sighing, and somewhere, a developer was crying.
What can we learn from this tale of holiday woe? Perhaps it’s that communication, no matter how clear, is only as good as the action it inspires. Or maybe it’s that one should never go on holiday without ensuring their work is done… or at least assigned to someone who isn’t equally holiday-bound.
So let us raise our mugs of coffee to Paul, the developer whose holiday will forever be remembered in the annals of Software Developmentia. May his next vacation be free of datagrid disasters and full of completed tasks.
And to all the Pauls out there, a word of advice: double-check your team’s to-do list before you hit that out-of-office button. It might just save your holiday… and your datagrids.