May a Thousand BLOOMs Flower
M.S. HDS Capstone Complete
For the last couple of months I’ve been too busy to write about the things I’ve been learning and doing, but if you’re curious (or have a previously incurable case of insomnia) click this link to access my just completed Capstone project for my Master’s in Health Data Science Program.
If you’re more of a slides / presentation style person, you can click this link and get an overview in slide form.
Tl;dr — I spent a semester learning about NLP and LLMs (some of that learning I’d already blogged about) and then tried to build out a process to customize a large language model for question answering — notably, a model that when I started the program did not have the native capability to be configured as such. I produced the first published BLOOM LLM for question answering tuned for the SQuAD dataset, which can be accessed here:
There’s a lot of content and context herein, and I may go back and blog about some of the many, many things I have learned since my last post, but for now I’m just going to celebrate the completion of this milestone and step away from the computer for a bit. :) In the meantime, I look forward to any comments from anyone regarding the project, what I got wrong, what I got right, and ways I could have done it better.