Projects

Highlights from recent roles, then previous work and the archive.

Highlights

NAVER Cloud

MLX Data Manager

A data management system designed for machine learning workloads, offering a Hugging Face-compatible interface for ease of adoption, along with data version control and lineage tracking.

DataMLKotlinPython
Details soon →
NAVER

Hand-written Font Generator

Given a sheet of handwritten paper, generate a font that resembles the handwriting.

MLKotlin
Coupang

Data Lake

Ingested and aggregated operational data—from Kafka into S3 (ORC format)—for analytics, ensuring integrity with deduplication and exactly-once delivery semantics.

javakafkas3

Previous Works

Archive

Older work and side projects

volunteer

BOINC

Contributing compute to BOINC—on the order of 127 GFLOPS on average for climate modeling, prime search, stellar-stream simulations, and similar projects.
distributed computingscience
Visit →
open source

Interview Questions

Curated a collection of interview questions and solutions from various tech companies, for personal reference and to support fellow engineers preparing for interviews.
PythonGoreference
Visit →
personal

Coding Expedition

Writing code away from home—cafés, road trips, and a map of places. Longer story and map on this page.
JavaScripttravelmaps
Visit →
open source

Better Translator

A language translator that uses Japanese as an intermediate language. Processes +20,000 translation requests on a daily basis (as of Sep 2013).
PythonJavaScriptapplicationweb
Visit →
proprietary

Smartrek

An ambitious project to solve urban traffic congestion problems. My primary responsibility in the team is to build a scalable system with Amazon EC2 platform, and to build in-house software for data analysis.
Javaapplicationtransportationbig data
Visit →
open source

Winnowing

A Python implementation of the Winnowing (local algorithms for document fingerprinting)
Pythonalgorithmdocument fingerprinting
Visit →
proprietary

ChemBook

An iOS app to help students with writing chemistry lab reports.
Objective CPythonapplicationchemistry
open source

Radioactivity

The project radioactivity had started in March of 2011 in an attempt to help the general public to be alerted regarding the spread of radiation and other radioactive materials originated from the catastrophic incident in the Fukushima nuclear power plant in Japan, by providing visual information of city-wise radition levels.
PythonJavaScriptapplicationwebdata visualization
Visit →
open source

gdocsdb

A Python library to use Google Spreadsheet as a database.
Pythonlibrarydatabaseopen source
Visit →
proprietary

Kindleholic

An RSS/Atom feed aggregation service for Amazon Kindle.
PythonJavaScriptapplicationwebebook
Visit →
academic

Teach Ourselves

A project to create a society for students in which learning and creativity is rewarded with points that can be redeemed for goods.
Action Scriptapplicationweb
Visit →
academic

Bootstrapped Learning

A DARPA sponsored project to develop an electronic student that can learn from natural human instruction close to the way humans do.
JavaC++machine learninginterdisciplinary
classified

maps

A project to reconstruct a map in global scale by collecting map tiles from various sources such as Google Maps and Open Street Map.
Pythonbig datadata mining
Visit →
open source

Readown

A Markdown reader for Mac OS X.
Objective Capplicationmacopen source
Visit →
proprietary

Violet

A children's book for iPad with user-interactable animations.
Objective Capplicationmobile
proprietary

CQube

A labyrinth-like puzzle game for iPhone.
Objective Capplicationmobilegame
academic

Mandelbrot set

A distributed processing project to draw a mandelbrot set using multiple computers.
JavaCapplication