Suppose you have a huge number of items that you would like to group together by a fuzzy notion of similarity. Suppose the only tool available to you is a key-value store. Suppose you only have the resources to consider each object once. Never fear, MinHashing is here!

# Ranking Servers for Natural Selection 2

The next version of Natural Selection 2 contains an algorithm I designed for ranking servers. It powers the “play now” button at…

# A Skill Ranking System for Natural Selection 2

Inferring player skill in a complicated team-based game.

# How to quickly clean up a photo of a drawing using Gimp and Physics

Using a sketch I made of my wife when we were dating, I’m going to show how you can get an image suitable for printing or coloring out of a mediocre photograph.

# How Karma Systems Should Work: The Beta-Binomial

How to Aggregate Fractions the Right Way. Fitting a Beta-Binomial to a Collection of Binomial Observations.

# Logs, Tails, Long Tails

Learn the shape of the logs of probability distributions, and you’ll be able to guess how they behave.

# A Short, Simple Introduction to Information Theory

Information Theory is one of the most useful things for Computer Scientists to understand about statistics, and it’s deeply related to things we do every day. If you are comfortable writing code but sometimes intimidated by math, this should give you an overview of the main concepts in information theory, and hopefully an understanding of why they work.

# How to Backpack

Everything you need to know to enjoy a week in the outdoors and be safe doing it.

# Arranging Music for A Cappella

This guide should get you started arranging music for your a cappella group if you have musical ideas but don’t know how to put them down on paper. It’s no substitute for really learning music theory, but if your primary concern is just arranging your favorite song, this will help get you there.