BEFORE · Current (Arial)
Girishkumar Ponkiya
Data Scientist (Vice President) | Applied AI & NLP
Mumbai, India · girish.isical@gmail.com · +91-9004009268
girishp.in · linkedin.com/in/girishponkiya · github.com/girishponkiya · Google Scholar
Professional Summary
Applied AI specialist and NLP practitioner with a Ph.D. from IIT Bombay and 6+ years building production AI systems in the financial industry. Builds LLM infrastructure at ISS STOXX — covering deployment, observability, and optimization — and serves as the organisation's go-to resource for NLP/LLM understanding and adoption, including a company-wide workshop series for software development teams. Published in ACL, EMNLP, and COLING; brings research-grade rigour to practical, scalable AI solutions.
Technical Skills
Applied NLP / LLMs: RAG pipelines, NER (spaCy fine-tuning), entity linking, knowledge graphs, text classification, word sense disambiguation
LLM Infrastructure: vLLM (deployment & monitoring), Telegraf, InfluxDB 2.0, Grafana, xGrammer, Kubernetes, Apache NiFi, Vespa
ML / DL Frameworks: PyTorch, TensorFlow, Keras, scikit-learn, Hugging Face Transformers (BERT, RoBERTa, T5)
Languages & Data: Python, C/C++, Java, SQL; Apache Iceberg, JupyterLab, Pandas, Flask
Professional Experience
Data Scientist (Vice President)
|
ISS STOXX
| Mar 2021 – Present
Mumbai, India | Individual contributor; cross-functional AI advisor. AVP → VP Jan 2026.
- Deployed and maintain multiple LLMs on internal GPU infrastructure using vLLM (Gemma3:27B), applying FP8 quantization, kv-cache quantization, multi-GPU tensor parallelism; diagnosed CPU-level AVX2/AVX-512 bottleneck that led to a server hardware upgrade. Manager described this as a "break-through moment for my team and the company."
- Designed and currently delivering company-wide "Introduction to LLM for Software Developers" workshop series (8 sessions planned; 3 delivered as of Mar 2026) using a flipped-classroom format. ~70 participants per live session; additional reach via recordings.
- Grew the ESG news pipeline's business relevance rate from ~2% to ~35% — analysts now need ~3 reads per relevant article, down from ~50. Led a Japanese-language expansion that raised business relevance for Japanese content from ~9% to ~33%. The pipeline handles ~700K articles/day across 10+ languages.
- Built and open-sourced vLLM Metrics Dashboard — a Telegraf → InfluxDB 2.0 → Grafana stack monitoring TTFT, throughput, KV-cache utilisation, HTTP responses, and Python GC across 3 vLLM inference instances.
- Built an AI-driven document processing pipeline for thematic index creation (PoC): PDF → Markdown → LLM summarisation. Business stakeholders validated the output positively.
→
AFTER · Recommended Hybrid (Inter + Libre Baskerville name)
Girishkumar Ponkiya
Principal-Level Data Scientist · LLM Infrastructure & Production NLP
Mumbai, India · girish.isical@gmail.com · girishp.in · linkedin.com/in/girishponkiya · github.com/girishponkiya
700K articles/day · 2% → 35% relevance lift · ~70 engineers/session · ACL · EMNLP · COLING
Technical Skills
Applied NLP / LLMs
RAG pipelines, NER (spaCy fine-tuning), entity linking, knowledge graphs
LLM Infrastructure
vLLM, Telegraf, InfluxDB 2.0, Grafana, Kubernetes, NiFi, Vespa
ML / DL Frameworks
PyTorch, Hugging Face Transformers, scikit-learn, TensorFlow
Languages & Data
Python, SQL, C/C++, Java · Iceberg, Pandas, Flask
Professional Experience
ISS STOXX · Data Scientist (Vice President)
Mar 2021 – Present
Technology Innovation Lab · Mumbai · IC; cross-functional AI advisor. Promoted Jan 2026.
- AVX2/AVX-512 CPU bottleneck diagnosed → hardware upgrade; deployed vLLM (Gemma3:27B) with FP8 quantization + tensor parallelism. Manager: "break-through moment for my team and the company."
- ~70 engineers upskilled/session via company-wide LLM series — RAG, fine-tuning, agents (8 planned; flipped-classroom).
- Relevance: 2% → 35% on 700K-article/day ESG pipeline (10+ languages); Japanese expansion 9% → 33%.
- Open-sourced vLLM Metrics Dashboard (Telegraf → InfluxDB 2.0 → Grafana). AI document pipeline (PoC, business validated).
Selected Publications
5 peer-reviewed publications · ACL-IJCNLP 2021, EMNLP 2020, COLING 2018, LREC 2018, ICON 2016
~73 citations · h-index 5 (Google Scholar, Apr 2026)