Use Cases

Semantic SegmentationAttention MechanismsMulti-Modal FusionRemote SensingPyTorch Lightning

↗

Flood segmentation from multispectral satellite imagery (ProCANet)

IEEE Geoscience and Remote Sensing Letters (GRSL), 2025, Q1 (IF 4.4) · First Author

Satellite flood maps for 25M people, hours instead of days, state-of-the-art IoU.

PyTorchPyTorch LightningGoogle Earth EngineSentinel-2PlanetScope +4

Read writeup →

Multi-Modal FusionSemantic SegmentationSARDEMFoundation ModelsLand Use

↗

Mining footprint detection with multi-modal satellite data

Remote Sensing of Environment, vol. 318, 2025, Q1 (IF 11.4) · Co-Author

The first geospatial foundation model applied to continental-scale mining footprint detection.

TensorFlowPyTorchGoogle Earth EngineSentinel-1Sentinel-2 +10

Read writeup →

Object DetectionCoastal MonitoringChange DetectionTime-Series SatellitePatent

Aquaculture pond detection and change analysis

Filed Patent IDS000010594, Issued June 2025 · Lead Developer

Catching illegal shrimp farms from orbit before regulators arrive, hours, not field-survey years.

PythonGoogle Earth EngineNDWIMNDWIAWEI +1

Read writeup →

Flood MitigationPolicy EvaluationNDWIMixed MethodsUrban Resilience

↗

Flood policy evaluation, retention pond effectiveness in South Bandung

AQUA, Water Infrastructure, Ecosystems and Society, 2025, Q2 · Contributor

Do retention ponds actually reduce flooding? We answered it with satellites and causal stats.

PythonGoogle Earth EngineNDWIDeep learning segmentationmixed-methods policy analysis

Read writeup →

Vision-Language ModelsMultilingual AIAI SafetyActivation SteeringCross-Modal ConflictApart ResearchHackathon

↗

Do Multilingual VLMs Abstain Under Cross-Modal Conflict in Low-Resource Languages?

Apart Research Global South AI Safety Hackathon (2026) · Co-First Author, Benchmark & Steering Lead

A model that trusts its eyes in English becomes a caption-follower in Telugu. We measured it, localized it, and steered it back.

InternVL3-2B/8BQwen2.5-VL-3B/7BQwen3-VL-8BLLaVA-OneVision-7BGLM-4.1V-9B-Thinking +16

Read writeup →

Vision-Language ModelsBenchmarkDatasetCrowdsourcingCultural AIACL 2025

↗

SEA-VL: multicultural vision-language benchmark for Southeast Asia

ACL 2025 (Main Conference, Long Paper) · Core Contributor, Data Pipeline Lead

GPT-4V fails on Southeast Asian culture. We built the benchmark that proves it, ACL 2025.

GPT-4VGemini 1.5Claude 3LLaVAInternVL +69

Read writeup →

VLMsCultural AIRegional AdaptationDiffusion ModelsKnowledge DistillationUnder Review

↗

GG-EZ: regional adaptation framework for vision-language models in SEA

Under Review + arXiv preprint 2026 · Lead, Diffusion Model Arm

Cultural AI for Southeast Asia without retraining from scratch, 98%+ global quality retained.

SDXLLLaVAInternVLPhi-VisionPyTorch +4

Read writeup →

NLPLanguage IdentificationLow-ResourceBenchmarkACL 2026

↗

CommonLID: language identification on noisy web data

ACL 2026 · Contributor

That 99% LID accuracy on SEA languages? It collapses on real noisy web data.

fastText LangIDGlotLIDOpenLIDCommonCrawlLow-resource SEA languages (11+) +41

Read writeup →

ForecastingConformal PredictionXGBoostGCP Vertex AIdbtMLOpsUncertainty Quantification

Share of Voice forecasting system, Fortune 500 APAC (Artefact)

Artefact, Senior Data Scientist (Feb 2025, Nov 2025) · Lead ML Engineer (Founding Technical Member, Jakarta Office)

Turned media gut-feel into calibrated prediction intervals across 6 APAC markets for a Fortune 500.

XGBoostConformal Prediction (split)GCP Vertex AIBigQueryGCS +12

Read writeup →

BiometricsFace VerificationAnti-SpoofingLiveness DetectionCredit ScoringFintechMLOpsRegulatory Compliance

Biometric authentication and alternative credit scoring at scale (GDP Labs)

GDP Labs (GLAIR.ai), Senior Data Scientist / ML Engineer (2021, 2023) · Lead ML Engineer, Promoted to Senior within 12 months

1M+ daily inferences at 99.99% uptime. Credit for the borrowers banks wouldn't score.

PyTorchscikit-learnXGBoostIntel OpenVINOMobileNet +14

Read writeup →

Fraud DetectionAnomaly DetectionGraph NetworksReal-Time InferenceKafkaRedisFintech

Real-time fraud detection pipeline (GDP Labs)

GDP Labs (GLAIR.ai), Lead ML Engineer (2021, 2023) · Lead ML Engineer

Three fraud attack types, one pipeline, rule filters, XGBoost, and graph anomaly detection.

XGBoostgradient boostinggraph anomaly detectionApache KafkaRedis +5

Read writeup →

Time Series ForecastingSmart CityProphetLSTMCausal InferencePolicy Analysis

Municipal waste logistics forecasting, Jakarta Smart City

Jakarta Smart City, Data Scientist (Jan 2021, June 2021) · Lead Data Scientist

Forecasted Jakarta's trash so trucks follow where waste actually piles up instead of a fixed weekly schedule.

PythonRFacebook ProphetARIMASARIMA +9

Read writeup →

Demand ForecastingClient DeliveryConsultingFinancial ServicesMedia

Demand and audience forecasting for financial and media clients (Artefact)

Artefact, Senior Data Scientist (2025) · Lead Data Scientist

Two Fortune 500 clients, two forecasting problems, two weeks each, shipped and handed off.

GCP (Vertex AIBigQuery)PythonXGBoostProphet +2

Read writeup →

Gradient BoostingGeospatial ClusteringDemand PredictionAzure MLReal-TimeHackathon

HakkTaxi: ride-share demand prediction (Microsoft Azure APAC Hackathon)

Microsoft Azure Virtual Hackathon APAC, Regional Champion (2020) · Lead ML Engineer

A Jakarta ride-demand heatmap built in 48 hours. Microsoft Azure APAC Regional Champion.

Microsoft Azure (Azure MLAzure Maps)XGBoostgradient boostingH3 grid +4

Read writeup →

Edge AIHealthcareComputer VisionGDPR ComplianceLow BandwidthHackathon

TeleHealthMonitor: Edge AI for remote patient monitoring (CamvsCovid)

CamvsCovid, Cambridge Judge Business School, Top 3 Globally (2020) · Lead ML Engineer

COVID-19 vitals monitoring on a 2G connection, no cloud, no smartphone. Top 3, Cambridge.

On-device (ARM CPU)optical flowpose estimationModel quantization (INT8/FP16)OpenCV +4

Read writeup →

NLPSpeech RecognitionIVRAccessibilityLow-ResourceHackathon

Community IVR: Voice AI for offline communities (Cal Hacks 8.0)

Cal Hacks 8.0, UC Berkeley, Best Community Track (2020) · Lead ML Engineer

Health info for 40% of Indonesians without smartphones, via a plain old phone call.

ASRTTSDTMF IVRKnowledge graphintent classification +50

Read writeup →

NLPCausal InferenceDifference-in-DifferencesPolicy AnalysisIEEESmart City

↗

Plastic bag ban, causal policy analysis using NLP and citizen data

ICISS 2021, IEEE · First Author

Jakarta banned plastic bags in 2020. Did it work? Proved it causally with 100K+ complaints.

Text classificationcomplaint taxonomyDifference-in-differenceshypothesis testingJAKI +11

Read writeup →

Vision-Language ModelsLoRA Fine-tuningQwen2.5-VLLLaMA-FactoryDeepSpeedVLM Training

Qwen VL Fine-tuning for AI City Challenge 2026 Track 2

AI City Challenge 2026, Track 2 (2026) · Lead ML Engineer

Fine-tuning a 3B VLM on a 14GB GPU, every memory constraint hit, diagnosed, and solved.

Qwen2.5-VL-3B-InstructLoRALLaMA-FactoryDeepSpeed ZeRO-2PyTorch 2.5.1+cu121 +4

Read writeup →

Conformal PredictionVision-Language NavigationUncertainty QuantificationVLN-DUETVLN-HAMTScore FunctionsICRA 2027

Conformal Prediction for Vision-and-Language Navigation (DUET + HAMT)

Thesis Research, VLN-DUET Uncertainty Propagation (2026) · Lead Researcher

21 uncertainty methods for robot navigation tested. Most fail. One works, and we know why.

PyTorchDeepSpeed ZeRO-2VLN-DUETVLN-HAMTTHR +75

Read writeup →

Open Source ContributionKubernetesGoInference SchedulingGateway APIEnvoyP/D DisaggregationKV Cache

Contributing to llm-d/inference-scheduler, Kubernetes LLM Inference Scheduling

Open Source, llm-d/inference-scheduler (Planned 2026) · Contributor

Contributing to the Kubernetes layer routing LLM requests across GPU backends at production scale.

Go 1.24+Gateway APIEnvoy ext-procKustomizeHelm +11

Read writeup →