Eric J Ma's Website

« 18 19 20 21 22 »

nxviz 0.5 released!

written by Eric J. Ma on 2018-08-01 | tags: nxviz visualization data science software open source

A new version of nxviz is released!

In this update, I have added a declarative interface for visualizing geographically-constrained graphs. Here, nodes in a graph have their placement constrained by longitude and latitude.

An example... (read more)

(117 words, approximately 1 minute reading time)
pyjanitor 0.3 released!

written by Eric J. Ma on 2018-07-27 | tags: open source pyjanitor data science

A new release of pyjanitor is out!

Two new features that I have added in include:

  1. Concatenating column names into a single column, such that each item is separated... (read more)

    (135 words, approximately 1 minute reading time)
SciPy 2018

written by Eric J. Ma on 2018-07-26 | tags: scipy conferences python

It's been about two weeks since SciPy 2018 ended, and I've finally found some breathing room to write about it.

SciPy 2018 is the 4th year I've made it to the conference, my first one being SciPy 2015 (not 2014, as I had originally... (read more)

(572 words, approximately 3 minutes reading time)
Bayesian Estimation, Group Comparison, and Workflow

written by Eric J. Ma on 2018-07-16 | tags: bayesian statistics data science

Over the past year, having learned about Bayesian inference methods, I finally see how estimation, group comparison, and model checking build upon each other into this really elegant framework for data analysis.

(read more)

(457 words, approximately 3 minutes reading time)
ECDFs

written by Eric J. Ma on 2018-07-14 | tags: statistics visualization data science

I detail why ECDFs are superior to histograms as a way of visualizing distributions. In short, they provide richer information than histograms do. Come learn about them!

Read on... (609 words, approximately 4 minutes reading time)
Git Tip: Apply a Patch

written by Eric J. Ma on 2018-06-17 | tags: git version control code snippets

I learned a new thing this weekend: we apparently can apply a patch onto a branch/fork using git apply [patchfile].

There's a few things to unpack here. First off, what's a patchfile?

The long story cut short... (read more)

(729 words, approximately 4 minutes reading time)
My Latent Dissatisfaction with Modern ML

written by Eric J. Ma on 2018-06-05 | tags: data science machine learning deep learning causal inference graph theory probability

It took reading Judea Pearl's "The Book of Why", and Jonas Peters' mini-course on causality, for me to finally figure out why I had this lingering dissatisfaction with modern machine learning. It's because modern machine learning (deep learning... (read more)

(662 words, approximately 4 minutes reading time)
Causal Modelling

written by Eric J. Ma on 2018-05-26 | tags: causal inference

Finally, I have finished Judea Pearl's latest work "The Book of Why"! Having read it, I have come to appreciate how much... (read more)

(208 words, approximately 2 minutes reading time)
Model Baselines Are Important

written by Eric J. Ma on 2018-05-06 | tags: machine learning data science deep learning automl

For any problem that we think is machine learnable, having a sane baseline is really important. It is even more important to establish them early.

Today at ODSC, I had a chance to meet both Andreas Mueller and Randy Olson. Andreas leads (read more)

(218 words, approximately 2 minutes reading time)
Consolidate your scripts using click

written by Eric J. Ma on 2018-03-30 | tags: programming code snippets scripting python data science

Overview

click is amazing! It's a Python package that allows us to add a command-line interface (CLI) to our Python scripts easily. This blog post is a data scientist-oriented post on how we can use... (read more)

(451 words, approximately 3 minutes reading time)
« 18 19 20 21 22 »