written by Eric J. Ma on 2025-05-25 | tags: cursor routing ui cognitive ai automation apple workflow design improvement
In this blog post, I explore Cursor's latest update featuring automatic model routing, which eliminates the need for manual model selection. This change reduces the cognitive tax of micro-decisions, allowing me to focus more on coding. Drawing parallels to Apple's design philosophy, I discuss how removing unnecessary interfaces can enhance user experience. This shift in AI tool design suggests a future where systems handle complexity, simplifying user interactions. What other invisible frictions in AI tools could be automated away to improve our workflow?
Read on... (777 words, approximately 4 minutes reading time)written by Eric J. Ma on 2025-05-24 | tags: vscode workspaces coding agents llamabot tutorial context ai organization efficient
In this blog post, I share how using Workspaces transformed my workflow while building out my tutorial repositories for SciPy 2025. By adding multiple repositories to a single workspace, I eliminated constant window switching and enabled my coding agent to access context across all repos simultaneously. This setup allows for seamless coordination between library code and tutorial examples. I provide a step-by-step guide to setting up a multi-repo workspace and offer tips for maximizing its benefits. Curious about how this can streamline your coding process and enhance your productivity?
Read on... (717 words, approximately 4 minutes reading time)written by Eric J. Ma on 2025-05-08 | tags: scipy tacoma seattle community tutorials llms networks sprints python open source
In this blog post, I share my excitement for SciPy 2025 in Tacoma, Seattle. The Pacific Northwest's beauty, the vibrant community of tool builders, and the chance to give back through financial aid make it special. I'm thrilled to teach tutorials on LLMs and network analysis, and I can't wait for the code sprints, where collaboration and learning thrive. If you're considering attending, the opportunity to meet tool makers and explore new libraries is unmatched. Will you join us in this unique experience?
Read on... (535 words, approximately 3 minutes reading time)written by Eric J. Ma on 2025-04-26 | tags: modal serverless pypi ollama gpu vscode storage cloud deployment cicd
In this blog post, I share my experiences with Modal, a platform that simplifies cloud infrastructure. I discuss deploying web apps and hosting Ollama with low latency using Modal's cost-efficient, serverless solutions. I also explore creating a 'serverless workstation' with persistent storage and flexible resource allocation. Modal's intuitive volume management, unified abstractions, and easy debugging make it a standout choice for developers. Curious about how Modal can transform your cloud projects?
Read on... (1299 words, approximately 7 minutes reading time)written by Eric J. Ma on 2025-04-19 | tags: standardization ai coding protein mass tools project collaboration development
In this blog post, I share my experience from a live coding demo at BioIT World 2025, where I built a protein mass spectrometry calculator tool. I emphasize the importance of standardization in data science, starting with a design document, and using AI assistance for rapid development. Despite a live demo hiccup, I showcased the tool's capabilities and highlighted key lessons in AI collaboration, such as the value of context and interactive communication. How can AI tools enhance your development process while maintaining human oversight and creativity?
Read on... (1088 words, approximately 6 minutes reading time)written by Eric J. Ma on 2025-04-08 | tags: marimo reactive notebooks uv deployment serverless data science modal
In this blog post, I share my experience with Marimo notebooks, highlighting their fully reactive nature and self-contained environments. I discuss how to run Marimo without installation using uv
, and the benefits of AI-assisted coding. I also cover exporting notebooks to Markdown and deploying them as Modal apps. While Marimo's keybindings differ from Jupyter, its reactive execution and UI builder offer unique advantages. Curious about how Marimo can transform your coding workflow?
written by Eric J. Ma on 2025-04-05 | tags: automation bayesian estimation robust estimation r2d2 experiment design
A high-throughput screening lab transforms their data analysis workflow by applying statistical thinking from the start. By combining robust estimation with R2D2 priors, they eliminate tedious manual data cleaning, automatically handle outliers, decompose sources of variation, and objectively measure both statistical estimation model and laboratory quality performance. This story demonstrates how thoughtful experimental design paired with principled statistical methods can dramatically improve both efficiency and scientific quality. How might statistical thinking transform your experimental workflow?
Read on... (1806 words, approximately 10 minutes reading time)written by Eric J. Ma on 2025-04-03 | tags: bayesian r2d2 variance modelling fluorescence experimental design probability of superiority probabilistic modelling data science statistics
In this blog post, I explore how to tackle experimental noise and candidate ranking in protein screening using Bayesian methods. By employing R2D2 priors, we can decompose variance into interpretable components, helping us understand the true biological signal versus experimental artifacts. Additionally, Bayesian superiority calculation allows us to quantify the probability that one protein outperforms another, providing a more robust comparison than traditional methods. These techniques are not only applicable to protein screening but also to drug discovery, materials science, and more. Are you ready to enhance your experimental insights with Bayesian logic?
Read on... (3956 words, approximately 20 minutes reading time)written by Eric J. Ma on 2025-04-02 | tags: data science leadership workflows best practices software change management innovation ai open source
In this blog post, I share insights from my talk at BioIT World about leading one of Moderna's data science teams. I discuss our mission to make science run at the speed of thought and how we standardize workflows across the data scientists to enhance creativity. Key points include designing delivery models, making best practices easy, and balancing standards with innovation. I also touch on AI-assisted coding and our open-source infrastructure. Our approach aims to liberate scientists for their best work. Curious about how we achieve this balance and what it means for the future of data science at Moderna?
Read on... (1943 words, approximately 10 minutes reading time)written by Eric J. Ma on 2025-03-17 | tags: open source python sprints matplotlib git networking career community skills development
In this blog post, I share my transformative experience participating in the SciPy sprints, where I made my first open source contribution to Matplotlib. Through this journey, I gained confidence in git, improved my software skills, and learned the importance of effective communication with maintainers. Sprints offer invaluable networking opportunities and skill development, especially for students and data scientists. By contributing, you can make a lasting impact on projects you care about. Are you ready to join a sprint and potentially change the trajectory of your career? Read on to find out more!
Read on... (570 words, approximately 3 minutes reading time)