slimmmed down all of the material split the regex part from the pandas notebook into its own seperate notebook