written by Eric J. Ma on 2021-03-01 | tags: machine learning system design data science
I think we're past the age of building fancy models. After about a decade or so of seeing where models could make an impact on decisions being made, I think we're entering a phase where operationalizing these models is more important than building bigger and more powerful models. The field of data science is evolving yet again, this time with a big offshoot branch with the name "Machine Learning Engineering".
The key problem here is that most data scientists don't possess the software and engineering skills to build and operationalize machine learning systems for colleagues. There's a big opportunity here to make machine learning systems work. The key here is to design machine learning systems the way software engineers design software systems: modular, with clean contracts, minimizing entanglement, etc. etc.
There are very few resources that I know of that can help us learn what needs to be done, but here are a few that I have encountered.
@article{
ericmjl-2021-machine-design,
author = {Eric J. Ma},
title = {Machine learning system design},
year = {2021},
month = {03},
day = {01},
howpublished = {\url{https://ericmjl.github.io}},
journal = {Eric J. Ma's Blog},
url = {https://ericmjl.github.io/blog/2021/3/1/machine-learning-system-design},
}
I send out a newsletter with tips and tools for data scientists. Come check it out at Substack.
If you would like to sponsor the coffee that goes into making my posts, please consider GitHub Sponsors!
Finally, I do free 30-minute GenAI strategy calls for teams that are looking to leverage GenAI for maximum impact. Consider booking a call on Calendly if you're interested!