Eric J Ma's Website

« 17 18 19 20 21 »

Mathematical Intuition

written by Eric J. Ma on 2018-12-09 | tags: deep learning bayesian math data science

Last week, I picked up Jeremy Kun's book, "A Programmer's Introduction to Mathematics". In it, I finally found an explanation for my frustrations when reading math papers:

What programmers would consider "sloppy" notation is one... (read more)

(775 words, approximately 4 minutes reading time)
Solving Problems Actionably

written by Eric J. Ma on 2018-11-13 | tags: data science insight data science

There's a quote by John Tukey that has been a recurrent theme at work.

It's better to solve the right problem approximately than to solve the wrong problem exactly.

Continuing on the theme of quoting two... (read more)

(326 words, approximately 2 minutes reading time)
Thoughts on Black

written by Eric J. Ma on 2018-11-12 | tags: python code style

Having used Black for quite a while now, I have a hunch that it will continue to surpass its current popularity amongst projects.

It's one thing to be opinionated about things that matter for a project, but don't matter personally. Like code... (read more)

(181 words, approximately 1 minute reading time)
Bayesian Modelling is Hard Work!

written by Eric J. Ma on 2018-11-07 | tags: bayesian data science statistics

It’s definitely not easy work; anybody trying to tell you that you can "just apply this model and just be done with it" is probably wrong.

Simple Models

Let me clarify: I agree that doing the first half of the... (read more)

(1011 words, approximately 6 minutes reading time)
More Dask: Pre-Scattering Data

written by Eric J. Ma on 2018-10-26 | tags: dask parallel data science optimization gridengine

I learned a new thing about dask yesterday: pre-scattering data properly!

Turns out, you can pre-scatter your data across worker nodes, and have them access that data later when submitting functions to the scheduler.

(read more)

(515 words, approximately 3 minutes reading time)
Parallel Processing with Dask on GridEngine Clusters

written by Eric J. Ma on 2018-10-11 | tags: parallel dask gridengine data science optimization

I recently just figured out how to get this working... and it's awesome! :D

Motivation

If I'm developing an analysis in the Jupyter notebook, and I have one semi-long-running function (e.g. takes dozens of seconds)... (read more)

(1988 words, approximately 10 minutes reading time)
Optimizing Block Sparse Matrix Creation with Python

written by Eric J. Ma on 2018-09-04 | tags: graph optimization numba python data science sparse matrix

Introduction

At work, I recently encountered a neat problem. I'd like to share it with you all.

One of my projects involves graphs; specifically, it involves taking individual graphs and turning them into one big... (read more)

(1216 words, approximately 7 minutes reading time)
3D Printed WiFi Access QR Codes: Part 2

written by Eric J. Ma on 2018-09-02 | tags: 3d printing python qr code

Part 2 of how to create 3D-printed QR codes!

Read on... (842 words, approximately 5 minutes reading time)
3D Printed WiFi Access QR Codes: Part 1

written by Eric J. Ma on 2018-09-01 | tags: python qr codes wifi 3d printing security coding technology diy networking home project guest network pyqrcode pypng solidpython numpy click flask

Over the weekend, I embarked on a cool DIY project to create a 3D printed QR code for our guest WiFi network.

I used Python and a few packages to generate the QR code. It was a fun, practical project that also satisfied my curiosity about QR codes.

Stay tuned for more! πŸ˜„πŸ‘¨β€πŸ’»πŸ“Ά

Read on... (658 words, approximately 4 minutes reading time)
Joint, conditional, and marginal probability distributions

written by Eric J. Ma on 2018-08-07 | tags: statistics probability bayesian data science

A little cheat sheet to help you remember what joint, conditional, and marginal distributions are.

Read on... (87 words, approximately 1 minute reading time)
« 17 18 19 20 21 »