AI & Data Science Professional

Creating Business Value
with AI Solutions

With 21 years of development experience and 6 years of AI/ML expertise, I design and build enterprise-grade AI systems.

21+
Years Experience
6+
Years AI/ML
9+
AI Services Launched
90%
Automation Rate
SJ

Seo Jeong-hwa

AI Architect & Team Lead

gum798@gmail.com
github.com/gum798
Incheon, South Korea

About Me

"We build what we envision."

From automotive, security, to fintech, I now specialize in AI/ML System Architecture. I focus on implementing AI services that create real business value beyond simple technology application.

I hold a Master's degree in Data Science & AI from Sogang University (4.27/4.5), and have successfully led various AI projects including RAG Framework, multimodal systems, and recommendation systems.

🤖
AI Agent
📊
Data Pipeline
💬
NLP/Chatbot
👁️
Computer Vision
AI & Data Science Projects

Featured Projects

AI projects that solved real business problems

Data Engineering AI Team Lead

Regional Currency Big Data Platform & Data Pipeline

Problem

Payment and operational data from regional currencies across multiple municipalities were fragmented, making data-driven policy decisions difficult.

Solutions

  • Enterprise Data Lakehouse: Designed unified Data Lake collecting from heterogeneous DBs (Oracle, MySQL) across municipalities
  • ETL Pipeline Automation: Apache Spark-based batch processing system handling millions of daily transactions
  • Data Governance: Implemented RBAC-based access control and pseudonymization for sensitive financial/personal data
Apache Spark Hadoop Python Oracle Airflow

Key Results

Real-time
Analysis time (from days)
Automated
Custom policy reports per municipality
NLP / Chatbot AI Dev Lead & PM

Regional Currency AI Chatbot - 9 Services (Multi-Tenant Architecture)

Problem

Customer inquiries for 9 municipalities' regional currency services surged, requiring CS cost reduction and 24/365 response capability.

Solutions

  • OSMU Engine: Multi-Tenant architecture with single AI core handling 9 different municipal policies
  • Hybrid NLP Model: Rule-based + KoBERT/BERT Intent Classification for maximum accuracy
  • MLOps Pipeline: Continuous model improvement through automatic retraining with conversation history
Python KoBERT TensorFlow FastAPI Redis

Key Results

60%+
Simple inquiry auto-resolution
9
Concurrent services (zero-downtime)
RAG / Enterprise Project Lead

Enterprise AI Chatbot (MS Teams Integration)

Problem

Employees' repetitive questions about internal policies, IT support, and HR needed automated responses. MS Teams integration was essential.

Solutions

  • MS Teams Bot: Instantly usable within Teams without separate app installation
  • RAG Model: Vector-based search and answer generation from internal wiki (Confluence) and policy documents, minimizing hallucination
  • Security Protocol: Hybrid On-Premise + Private Cloud configuration compliant with enterprise security guidelines
MS Bot Framework Azure Cognitive Python Elasticsearch

Key Results

40%
Helpdesk ticket reduction
Automated
New employee onboarding guide
RecSys / ML Data Scientist Lead

Personalized Recommendation Service for Regional Currency

Problem

Beyond simple payment, a customized benefit system was needed to boost local merchant sales and enhance user benefits.

Solutions

  • Hybrid Recommendation: Collaborative Filtering + Content-Based Filtering combination for improved accuracy
  • Real-time Context: "Places to visit now" recommendations combining user location (Geo-fencing), weather, and time
  • User Profiling: Vector-based similar merchant recommendations using activity area, preferred categories, and consumption time
Scikit-learn Matrix Factorization Python PostgreSQL Redis

Key Results

+15%
Recommended merchant visit CTR
Validated
SMB marketing platform value
Computer Vision / OCR AI Tech Lead

Merchant Registration Document Classification & OCR Automation

Problem

Tens of thousands of business registration certificates and bank statements submitted for merchant registration were being manually reviewed and entered, causing inefficiency.

Solutions

  • Image Classification: CNN-based model (EfficientNet) for automatic document type classification
  • OCR Extraction: Clova OCR + Tesseract ensemble for key field (business ID, account number) text extraction
  • Verification: Automatic validity check through National Tax Service and Financial Settlement API integration
  • HITL Interface: UI/UX for human review of only low-confidence AI predictions
PyTorch OpenCV OCR API EfficientNet Django

Key Results

90%
Review time reduction (5min→30sec)
Zero
Human Error eliminated

Tech Stack

Core technologies used in AI/ML projects

AI & Machine Learning

PyTorch TensorFlow Scikit-learn HuggingFace LangChain OpenAI API

NLP & LLM

GPT-4 BERT/KoBERT RAG Vector DB Embedding Prompt Engineering

Data Engineering

Apache Spark Airflow Elasticsearch PostgreSQL Redis Docker
Open to Opportunities

Let's Work Together

I welcome various collaboration opportunities including
AI/ML projects, system architecture design, and team leadership.

© 2026 Seo Jeong-hwa. Built with AI-Assisted Development