May a Thousand BLOOMs Flower

Jason Eden
2 min readDec 2, 2022

--

M.S. HDS Capstone Complete

For the last couple of months I’ve been too busy to write about the things I’ve been learning and doing, but if you’re curious (or have a previously incurable case of insomnia) click this link to access my just completed Capstone project for my Master’s in Health Data Science Program.

Creative title. Deceptive too! But perhaps I’ve already said too much…

If you’re more of a slides / presentation style person, you can click this link and get an overview in slide form.

More colorful. Just as deceptive.

Tl;dr — I spent a semester learning about NLP and LLMs (some of that learning I’d already blogged about) and then tried to build out a process to customize a large language model for question answering — notably, a model that when I started the program did not have the native capability to be configured as such. I produced the first published BLOOM LLM for question answering tuned for the SQuAD dataset, which can be accessed here:

There’s a lot of content and context herein, and I may go back and blog about some of the many, many things I have learned since my last post, but for now I’m just going to celebrate the completion of this milestone and step away from the computer for a bit. :) In the meantime, I look forward to any comments from anyone regarding the project, what I got wrong, what I got right, and ways I could have done it better.

--

--

Jason Eden

Data Science & Cloud nerd with a passion for making complex topics easier to understand. All writings and associated errors are my own doing, not work-related.