written by Eric J. Ma on 2018-07-14 | tags: statistics visualization data science
I detail why ECDFs are superior to histograms as a way of visualizing distributions. In short, they provide richer information than histograms do. Come learn about them!
Read on... (611 words, approximately 4 minutes reading time)written by Eric J. Ma on 2018-06-17 | tags: git version control code snippets
I learned a new thing this weekend: we apparently can apply a patch onto a branch/fork using git apply [patchfile]
.
There's a few things to unpack here. First off, what's a patchfile
?
The long story cut short... (read more)
(729 words, approximately 4 minutes reading time)written by Eric J. Ma on 2018-06-05 | tags: data science machine learning deep learning causal inference graph theory probability
It took reading Judea Pearl's "The Book of Why", and Jonas Peters' mini-course on causality, for me to finally figure out why I had this lingering dissatisfaction with modern machine learning. It's because modern machine learning (deep... (read more)
(662 words, approximately 4 minutes reading time)written by Eric J. Ma on 2018-05-26 | tags: causal inference
Finally, I have finished Judea Pearl's latest work "The Book of Why"! Having read it, I have come to appreciate... (read more)
(208 words, approximately 2 minutes reading time)written by Eric J. Ma on 2018-05-06 | tags: machine learning data science deep learning automl
For any problem that we think is machine learnable, having a sane baseline is really important. It is even more important to establish them early.
Today at ODSC, I had a chance to meet both Andreas Mueller and Randy Olson. Andreas leads
written by Eric J. Ma on 2018-03-30 | tags: programming code snippets scripting python data science
click
is amazing! It's a Python package that allows us to add a command-line interface (CLI) to our Python scripts easily. This blog post is a data scientist-oriented post on how we can use click
to build...
(read more)
written by Eric J. Ma on 2018-02-28 | tags: data science deep learning message passing neural networks software engineering graph theory
At work, I’ve been rolling my own deep learning package to experiment with graph convolutional neural networks. I did this because in graph-centric deep learning, an idea I picked up from this paper, the inputs, convolution kernels, and much more,... (read more)
(657 words, approximately 4 minutes reading time)written by Eric J. Ma on 2018-02-26 | tags: teaching education datacamp
It always brings me joy to see others benefit from what I can offer.
Thanks for sharing the fruits of your journey on LinkedIn, Umar!
Also a big... (read more)
(103 words, approximately 1 minute reading time)written by Eric J. Ma on 2018-02-25 | tags: programming python testing software engineering
In this blog post, let me share with you how we can selectively run a test using pytest
's machinery.
written by Eric J. Ma on 2018-02-21 | tags: software engineering data visualization nxviz graph theory data science
In celebration of the first pull request that the nxviz
project has received!