Curated selection of code I've written.
- Merged PR to scikit-learn: Changed default behavior of ROC curve computation to work better with large datasets by dropping unnecessary thresholds, and implemented new tests for the new behavior.
- Merged PR to scipy: Replaced an O(N^2) algorithm for mode with an O(N) one, with benchmarks and unit tests.
- Solution to Project Euler 517: Uses dynamic programming and a few arithmetic algorithms to solve this problem.
- WIP Attempt for Project Euler 514: Uses stratified sampling to estimate the expected value of a complex distribution, attempt to solve this problem.
- Churchill Navigation Challenge: Implements building and then searching a kd-tree for the top
k
points within a given rectangle as part of a programming challenge. Details of implementation in link.
- Statistical Interactive Explorer of Vaccine Efficacy: Final project for data visualization CS class, visualizations to aid in statistical analysis of vaccine studies, with a publication in preparation. Current product here. I implemented selection in overview, stacked bar charts, box charts, among others.
- Fremont Bicycle Count Visualization: Assignment for same class, final product here. I implemented the selection controls and data manipulation parts (which refilter and reaggregate upon changing selection).
- Solution to Project Euler 517: A Scala translation of the Python example above.