Projects

Highlights from recent roles, then other workplace projects and the archive.

Highlights

NAVER Cloud

MLX Data Manager

A data management system designed for machine learning workloads, offering a Hugging Face-compatible interface for ease of adoption, along with data version control and lineage tracking.

NAVER

Hand-written Font Generator

Given a sheet of handwritten paper, generate a font that resembles the handwriting.

Coupang

Data Lake

Ingested and aggregated operational data—from Kafka into S3 (ORC format)—for analytics, ensuring integrity with deduplication and exactly-once delivery semantics.

Previous Works

Archive

Older work and side projects

open source

Base62

A Python library to encode and decode any arbitrary data in base62 (duosexagesimal; using 0-9, A-Z, and a-z) for URL-safety. I was recently able to convince my colleagues to use this library for work.
Visit →
proprietary

Smartrek

An ambitious project to solve urban traffic congestion problems. My primary responsibility in the team is to build a scalable system with Amazon EC2 platform, and to build in-house software for data analysis.
Visit →
open source

Radioactivity

The project radioactivity had started in March of 2011 in an attempt to help the general public to be alerted regarding the spread of radiation and other radioactive materials originated from the catastrophic incident in the Fukushima nuclear power plant in Japan, by providing visual information of city-wise radition levels.
Visit →
academic

Bootstrapped Learning

A DARPA sponsored project to develop an electronic student that can learn from natural human instruction close to the way humans do.
academic

Mandelbrot set

A distributed processing project to draw a mandelbrot set using multiple computers.