Dipak Meher
PhD Student · GMU

Dipak Meher

PhD Student in Computer Science
George Mason University

PhD student researching the intersection of machine learning, knowledge graphs, NLP, large language models (LLMs), and graph-based retrieval-augmented generation (RAG). Experienced in building end-to-end ML pipelines with PyTorch, TensorFlow, and Hugging Face, with 2 years of prior industry experience. Publications at KDD, ICDM, ICKG, SDM, and IJCNN, focusing on robust knowledge graph construction and cross-domain recommendation. Seeking Summer 2026 internships.

Dipak Meher

Latest News

Scroll to view more

  • Feb 2026
    Served as Reviewer — ACM Transactions on Intelligent Systems and Technology (TIST)
  • Nov 2025
    Inside CORE-KG accepted at ICDM 2025 (IEEE). Congratulations Prof. Domeniconi.
    View Paper ↗
  • Nov 2025
    LINK-KG accepted at ICKG 2025 (IEEE). Congratulations Prof. Domeniconi and Prof. Guadalupe.
    View Paper ↗
  • Aug 2025
    CORE-KG accepted at KDD 2025 Workshop (SKnow-LLM). Congratulations Prof. Domeniconi and Prof. Guadalupe.
    View Paper ↗
  • Nov 2024
    Invited talk: GraphRAG for Query-Focused Summarization (GMU)
  • Oct 2024
    Our paper "Understanding User Behavior in Cross-Domain Recommendation: An LLM-Based Approach" has been accepted at IJCNN 2025. Congratulations Prof. Rosenblum and Ajay.
    View Paper ↗
  • Sept 2024
    Served as Reviewer — IEEE International Joint Conference on Neural Networks (IJCNN) 2025
  • Sept 2024
    Invited talk: Mitigating Temporal Degradation in Entity Linking (GMU)
  • Aug 2024
    Appointed President, Bhakti Yoga Club, George Mason University
  • Oct 2024
    Our paper "Vietoris-rips complex: A new direction for cross-domain cold-start recommendation" has been accepted at SDM 2024. Congratulations Ajay, Prof. Rosenblum, Prof. Zhu and Shrunal.
    View Paper ↗

Research Experience

View Publications →

Graduate Research Assistant, George Mason University

Advisor: Prof. Carlotta Domeniconi

Aug 2024 – Aug 2025 Fairfax, VA

LINK-KG

Knowledge graph construction (short + long docs)
  • Designed an LLM-based pipeline for knowledge graph construction from complex legal narratives, addressing long-range coreference, entity disambiguation, and reference normalization.
  • Developed a three-stage coreference system using a type-specific prompt cache to resolve plural mentions, role shifts, and alias ambiguity, mitigating context-window limitations and loss-in-the-middle issues in LLMs.
  • Achieved a 45.21% reduction in node duplication and 32.22% decrease in legal noise over GraphRAG and CORE-KG.

CORE-KG

Human smuggling network analysis
  • Developed a modular LLM-driven framework integrating type-aware coreference resolution and domain-specific prompting to construct coherent knowledge graphs from human smuggling legal cases.
  • Introduced sequential entity extraction and legal-specific filtering to reduce attention drift, suppress procedural noise, and improve entity-type accuracy in graph construction.
  • Achieved a 33.28% reduction in node duplication and 38.37% reduction in legal noise over GraphRAG.
LLMs Knowledge Graphs Coreference GraphRAG Legal NLP

Graduate Research Assistant, George Mason University

Advisor: Prof. David Rosenblum

May 2023 – Aug 2024 Fairfax, VA

Quantifying Cross-Domain User Behavior Consistency with LLMs

  • Proposed the first LLM-based framework to quantify user behavior consistency across domains via item feature extraction, sentiment analysis, and behavior alignment.
  • Conducted an empirical study across 12 domain pairs and 4 state-of-the-art CDR models, showing current methods underutilize behavior consistency.
  • Observed strong consistency in similar domains (Books–Movies: 0.87) and lower consistency in dissimilar domains (Electronics–Food: 0.23).

LLMs for Transfer Learning in Recommender Systems

  • Co-authored an LLM-driven framework for cross-domain recommendation, improving personalization in sparse-data cold-start and warm-start scenarios.
  • Benchmarked six baselines (TGT, CMF, EMCDR, PTUPCDR + DisenCDR, UniCDR) and evaluated prompt strategies with/without target-domain examples.

Vietoris-Rips Complex for Cross-Domain Cold-Start Recommendation

  • Implemented and benchmarked baseline models (TGT, CMF, EMCDR, PTUPCDR) for cross-domain recommendation (SDM 2024).
  • Developed VRCDR using Vietoris–Rips Complex and Area-based Triangulated Embedding (ATE) to model users’ niche source-domain preferences as geometric structures.
  • Achieved up to 20% improvement over SOTA methods under extreme cold-start conditions.
Recommender Systems Transfer Learning LLM Prompting Cross-domain

Research Intern, Indian Institute of Technology, Kharagpur

Advisor: Prof. Pawan Goyal

May 2019 – Jun 2019 Kharagpur, India
  • Developed an apparel classifier using deep learning on the 800K-image DeepFashion dataset.
  • Improved model robustness via data augmentation and hyperparameter tuning, achieving 89% accuracy.
Deep Learning Computer Vision DeepFashion

Publications

Published
Conference Paper

Dipak Meher, C. Domeniconi, G. Correa-Cabrera

“LINK-KG: LLM-Driven Coreference-Resolved Knowledge Graphs for Human Smuggling Networks”

ICKG 2025 (IEEE) 2025 📄 PDF
Published
PhD Forum

Dipak Meher, C. Domeniconi

“Inside CORE-KG: Evaluating Structured Prompting and Coreference Resolution for Knowledge Graphs”

ICDM 2025 (IEEE) · PhD Forum 2025 📄 PDF
Published
Workshop Paper

Dipak Meher, C. Domeniconi, G. Correa-Cabrera

“CORE-KG: An LLM-Driven Knowledge Graph Construction Framework for Human Smuggling Networks”

KDD 2025 (ACM) · SKnow-LLM Workshop 2025 📄 PDF
Published
Conference Paper

Dipak Meher, A. Krishna Vajjala, D. Rosenblum

“Understanding User Behavior in Cross-Domain Recommendation: An LLM-Based Approach”

IJCNN 2025 (IEEE) 2025 📄 PDF
In Review
Conference Paper

A. Krishna Vajjala, Dipak Meher, D. Rosenblum

“Cross-Domain Recommendation Meets Large Language Models”

In Review 📄 PDF
Published
Conference Paper

A. Krishna Vajjala, Dipak Meher, S. Pothagoni, Z. Zhu, D. Rosenblum

“Vietoris-Rips Complex: A New Direction for Cross-Domain Cold-Start Recommendation”

SDM 2024 (SIAM) 2024 📄 PDF

Work Experience

Systems Engineer, Tata Consultancy Services (TCS)

Nov 2020 – Jul 2022 Mumbai, India
  • Worked on a US-based client project on PowerApps, developing UI screens, custom controls, and critical functionalities.
  • Overcame limitations of built-in PowerApps controls by building custom controls using TypeScript in PCF.
  • Analyzed client data and key parameters (including data types) to develop compatible custom solutions.
  • Led a team of 4 to deliver the product’s critical features on time.
  • Executed a POC in PowerApps using an AI Text Recognition model to automate form filling.
  • Followed Microsoft-recommended coding standards for components and PCF controls; reduced execution time and improved application performance by 10%.
  • Followed the Scaled Agile Framework (SAFe) and used Azure DevOps to track sprint progress.
PowerApps PCF TypeScript Power Automate Azure DevOps SAFe Agile

Academic Projects

Selected coursework projects across NLP and Data Mining

LLM Post-Training for Multilingual Reasoning

NLP Multilingual Post-training
Aug 2025 – Dec 2025
  • Studied multilingual reasoning failures caused by implicit translation drift across five languages on the BBQ benchmark.
  • Built a translation-aided post-training framework using SFT and GRPO on synthetic translated data without labels.
  • Improved compute-normalized multilingual accuracy and analyzed translation fidelity using COMET.

Multilingual Text Classification using BERT

NLP BERT
Sept 2023 – Oct 2023
  • Fine-tuned BERT for multilingual text classification into 15 categories with 85.1% test accuracy.
  • Managed data collection, augmentation, and model training/testing on a 12K multilingual dataset.
  • Achieved 94% accuracy for cross-lingual tests for multilingual applications.

Sentiment Analysis using Logistic Regression

Data Mining Classical ML
Jan 2023 – Feb 2023
  • Performed sentiment analysis on 20K product reviews with preprocessing, stemming, and stopword removal.
  • Evaluated bag-of-words, n-gram, and TF-IDF methods; achieved 87% accuracy using bi-gram bag-of-words.
  • Developed and trained the model in Python using NLTK, Scikit-learn, and Pandas for end-to-end implementation.

Education

PhD in Computer Science

George Mason University, Fairfax, VA

Aug 2024 – Present

Research focus on machine learning, knowledge graphs, large language models, and graph-based retrieval-augmented generation (RAG).

MS in Computer Science

George Mason University, Fairfax, VA

Aug 2022 – May 2024
GPA: 3.96 / 4.0 Distinguished Academic Award

BTech in Computer Science and Engineering

SGGSIET, Nanded, India

Aug 2016 – Nov 2020
GPA: 8.56 / 10.0

Technical Skills

Programming languages, frameworks, and tools

Programming & Frameworks

Python Java C++ C SQL Go React Node.js Angular Spring Boot

ML, NLP & Systems Tools

PyTorch TensorFlow Scikit-learn Hugging Face LangChain SpaCy Ollama AWS S3 Docker

Academic & Community Engagement

Talks, service, leadership, and community contributions

Invited Talks

Selected invited seminars and workshop talks

Highlights
  • Mitigating Temporal Degradation in Entity Linking — Data Mining Lab, GMU · 2024
  • GraphRAG for Query-Focused Summarization — Data Mining Seminar, GMU · 2024
  • Understanding User Behavior in Cross-Domain Recommendation — IJCNN 2025, Rome, Italy
  • Knowledge Graph (KG) Construction with LLMs — KDD 2025 Workshop (SKnow-LLM), Toronto, Canada
  • LINK-KG: Coreference-Resolved KGs for Human Smuggling Networks — ICKG 2025, Limassol, Cyprus
Seminars Workshops Conferences

Mentorship

  • Elijah Feldman Thomas Jefferson High School for Science and Technology · Research Mentorship

Academic Service

  • Reviewer — IEEE International Joint Conference on Neural Networks (IJCNN), 2025
  • Reviewer — ACM Transactions on Intelligent Systems and Technology (TIST), 2026

Leadership & Volunteering

Student leadership, community service, and technical volunteering

President, Bhakti Yoga Club

George Mason University

Jan 2023 – Present
  • Managed official university requirements, including approvals, room bookings, and event coordination.
  • Organized and led weekly sessions on Yoga, Meditation, Mindfulness, and Vegetarian lifestyle with a team of 5.
  • Oversaw weekly logistics and coordinated food distribution during events for 30 students every week.

Core Manager, VOICE Club

Shri Guru Gobind Singhji Institute of Engineering and Technology, Nanded

Aug 2016 – Aug 2017
  • Organized student events on Mind Control, Stress Management, and self-development.
  • Led a team of 5 members, handling responsibilities including crowd management and logistics.
  • Mentored 10 students through the “Discover Yourself” course inspired by the Bhagavad Gita.

Web Developer, PRAGYAA 2016

Shri Guru Gobind Singhji Institute of Engineering and Technology, Nanded

Aug 2016 – Mar 2017
  • Member of the web design team for a national-level technical fest.
  • Designed and developed the event website and Android application with a team of 10 students.
  • Maintained the website and application during peak event hours.

Contact

Feel free to reach out for research collaboration, internships, or academic discussions.