Blog - Eric J. Ma's Personal Site

Keys to effective collaborative data science

written by Eric J. Ma on 2024-10-18 | tags: data science science collaboration standardization empathy automation reproducibility reviews coding research

In this blog post, I discuss enhancing collaboration between data scientists and non-computational colleagues to ensure impactful work. I emphasize the importance of standardizing code initialization, involving non-coders in code reviews, empathetic communication across roles, and ensuring work is runnable on any computer. These practices aim to bridge gaps, reduce cognitive load, and foster mutual appreciation of challenges faced in data science projects. How can these strategies be implemented in your team?

Read on... (1626 words, approximately 9 minutes reading time)

What brings you joy at work?

written by Eric J. Ma on 2024-10-09 | tags: joy standardization mentorship building speed gatherings puzzles investment conversations standardization

In this blog post, I explore what brings me joy at work, highlighting my passion for setting high standards, mentoring, building tools, and engaging in meaningful projects at Moderna. I discuss the satisfaction derived from creating efficient solutions and the thrill of solving complex problems. Conversely, I touch on aspects of my job that are less joyful, like operational tasks and unnecessary complexity, which I strive to minimize through automation and effective processes. What brings you joy at work, and what removes joy from work for you?

Read on... (1238 words, approximately 7 minutes reading time)

Building Pigeon Secure Notes in under 15 minutes of coding

written by Eric J. Ma on 2024-10-06 | tags: ai programming productivity software development automation cryptography

In this post, I detail my experience building Pigeon Secure Notes, a secure note-sharing app, in under 15 minutes of coding time. I highlight the use of AI-assisted coding tools like Cursor and Claude Sonnet, discuss the challenges faced during development and deployment, and reflect on the benefits and limitations of AI-assisted programming for experienced developers. The post also touches on the importance of practical wisdom in software development and the thrill of building at "the speed of thought."

Read on... (1276 words, approximately 7 minutes reading time)

Explain your Jupyter notebooks using LlamaBot

written by Eric J. Ma on 2024-09-27 | tags: jupyter notebooks llm automation documentation productivity data science python llamabot

In this blog post, I introduce a new tool that uses LLMs to automatically generate explanations for Jupyter notebook cells. I discuss the motivation behind creating this tool, its current capabilities and limitations, and provide recommendations for optimal usage. The post also includes instructions on how to install and try out the tool for yourself. Curious about how it can streamline your data science projects? Why not give it a try?

Read on... (484 words, approximately 3 minutes reading time)

Recreating Shortwhale with AI-Assisted Coding

written by Eric J. Ma on 2024-09-23 | tags: coding email htmx sqlite fastapi cloudmailin digital ocean deployment web development ai coding

In this blog post, I share my experience recreating Dan Ariely's Shortwhale using a tech stack that includes HTMX, SQLite, FastAPI, CloudMailin, and DigitalOcean. I highlight the transformative role of AI-assisted coding with Cursor, which allowed me to build core functionality in under two hours. The project, now live, was an experiment and a learning opportunity, emphasizing the speed and ease of AI-assisted web development. Curious about how AI can accelerate your web projects?

Read on... (519 words, approximately 3 minutes reading time)

How to set up Pixi with CodeArtifacts

written by Eric J. Ma on 2024-09-19 | tags: pixi aws codeartifacts package management devops python configuration

In this blog post, I share my experience integrating Pixi with AWS CodeArtifact, detailing the steps needed to configure Pixi for internal package publishing at work. I discuss the installation of pipx and keyrings.codeartifact, editing keyring configurations, and setting up Pixi's global configuration. The guide aims to help others overcome similar integration challenges (obviously without revealing company-specific details). Curious about how these configurations can streamline your development process?

Read on... (945 words, approximately 5 minutes reading time)

Sync GitHub secrets with your .env and gh CLI

written by Eric J. Ma on 2024-09-15 | tags: github secrets environment-variables gh-cli automation devops productivity security til

Today, I learned that we can easily sync our local .env file with GitHub secrets using the GitHub CLI (gh). This method is much faster and less error-prone than manually entering secrets through the web interface. Curious to see how it works?

Read on... (206 words, approximately 2 minutes reading time)

Cursor is a jetpack for coders

written by Eric J. Ma on 2024-09-14 | tags: coding productivity ai cursor developertools ide automation programming efficiency innovation

In this post, I explore how Cursor, an AI-powered IDE, transformed my coding workflow and supercharged my productivity. Learn about its standout features and why it's become my secret weapon for efficient development and writing. Are you ready to revolutionize your coding experience?

Read on... (885 words, approximately 5 minutes reading time)

On writing LLM evals in pytest

written by Eric J. Ma on 2024-09-06 | tags: evaluations pytest documentation automation testing validation changes criteria staleness

In this blog post, I explore the process of writing evaluations for LLM systems using pytest, aiming to move beyond subjective assessments to more structured testing. I detail the creation of specific tests to assess if LLMs can accurately determine documentation staleness, using various models and criteria. The challenges and insights gained from setting up these evaluations reveal the complexities involved in ensuring that LLMs perform as expected. Could this method enhance the reliability of your LLM evaluations?

Read on... (1739 words, approximately 9 minutes reading time)

LlamaBot now has StructuredBot!

written by Eric J. Ma on 2024-08-31 | tags: structured generation llamabot python documentation llm pydantic software development testing structuredbot technology

In this blog post, I discuss the latest updates to LlamaBot, particularly focusing on the StructuredBot feature introduced by Elliot Salisbury. StructuredBot leverages JSON mode of LLMs for structured outputs, significantly simplifying the process of generating reliable and type-safe outputs without manual string parsing. I illustrate its application in an automated documentation checker and writer, enhancing productivity by integrating LLM-based and traditional programming methods. Curious about how StructuredBot can streamline your documentation process?

Read on... (1272 words, approximately 7 minutes reading time)

Eric J Ma's Website