create script in python to format said data to C begin transition to using C++ functions
factor out reused functions to shared.cpp/shared.h