Blog
Data Science & ML

Handy Dandy Guide to Working With Timestamps in pandas
Using times in pandas can sometimes be tricky--this blog post covers the most common problems.
See more

Computer Vision at Scale With Dask and PyTorch
This tutorial walks through how to use PyTorch and Dask to train an image recognition model across a GPU cluster.
See more

Cross-Entropy Loss Function
When working on a Machine Learning or a Deep Learning Problem, loss/cost functions are used to optimize the model during training. The …
See more

Random Forest on GPUs: 2000x Faster than Apache Spark
This blog post compares using RAPIDS and Dask vs Apache Spark for model training
See more

Supercharging Hyperparameter Tuning with Dask
The distributed computing framework Dask is great for hyperparameter tuning, since you can train different parameter sets concurrently.
See more

Practical Issues Setting up Kubernetes for Data Science on AWS
Data science has unique workflows that don't always match those of software engineers and require special setup for Kubernetes.
See more