Vickey Ghimire

GIS | Software | Urban Resilience

Hello!
    I am Vickey, an incoming M.S. student at the University of Texas at Dallas, studying Geospatial Information Sciences with a focus on disaster resilience and urban planning. I received my B.S. in Computer Science from Northern Kentucky University, graduating Summa Cum Laude with minors in Mathematics & Information Technology.
    My interest in CS was rooted in the power of digital creation. Now, I am channeling that foundation into an applied direction: how can the intersection of spatial data and machine learning help build more resilient communities?
    Outside of academics, I like solving puzzles, playing and listening to music, or getting lost in a good novel or history book. I love figuring things out; whether it's a tricky bug, a plot twist, or a moment in time.

:)

Education

Click entry for details

M.S. in Geospatial Information Sciences

The University of Texas at Dallas

Incoming Fall 2026

Honors: John Forrest Kain Scholarship

B.S. in Computer Science

Northern Kentucky University

August 2022 - May 2026

Minors: Mathematics and Information Technology

GPA: 3.94

Honors: Summa Cum Laude, EDGE AWARD, President's Honors List

Experience

Click entry for details

Full-Stack Django Developer

Department of Physics, Geology & Engineering Technology, NKU

Fall 2025 - Spring 2026

Worked on the web application for SIRES (Smart Integrated Renewable Energy Systems) project.

Key contributions include:

• Engineered a modular, class-based Django architecture refactoring a legacy tool into an MTV-compliant multi-user web platform with secure authentication and SQLite-based data persistence
• Built dynamic data pipelines integrating NASA POWER and OpenStreetMap APIs, replacing static spreadsheet extraction with location-specific environmental inputs.
• Optimized energy system (NGA-III) computation efficiency by ~35% through convergence-based termination and in-session result caching.
• Upgraded the frontend with Chart.js visualizations and refined UI typography, delivering interactive analytics on optimization results.

Undergraduate Research Assistant

College of Informatics, NKU

Fall 2024 - Spring 2026

As a research assistant at NKU's Human-Computer Interaction Lab, I primarily worked on projects around language model behavior and natural language processing.

Advisors: Dr. Nazmus Sadat and Dr. Nicholas Caporusso

Key contributions include:

• Architected a novelty-detection framework using lexical (Jaccard) and semantic (BERT-based) measures to auto-classify LLM outputs, reducing manual review burden from 32.8% to 3.1% across 31,000+ generated topics.
• Led survey-based analysis of ChatGPT's memory feature across 135+ users and 55+ accounts, uncovering critical gaps in user awareness and data privacy practices.
• Co-authored 3 peer-reviewed papers (2 as first author), leading data analytics workflows from preprocessing and statistical modeling to data viz production.

Undergraduate Teaching Assistant

College of Informatics, NKU

Fall 2025

As a Teaching Assistant for CSC 364: Data Structures and Algorithms in Java, I helped facilitate the course by supporting student learning and assisting the instructor as needed.

Course Instructor: Dr. Junxiu Zhou

Key responsibilities:

• Held weekly office hours mentoring 40+ students on DSA concepts, debugging, and Java problem-solving.
• Collaborated with instructor to develop Java test cases, design assessment rubrics, and streamline grading workflows.

Business Services Supervisor

NKU Campus Recreation Center

Fall 2025 - Spring 2026

As a Business Services Supervisor (Student Employment) at Kentucky's largest recreation center, I oversaw membership services operations, supervised student employees, managed sales and membership reports, supported special events and more.

Before transitioning to this role, I also worked as a Facilities Manager (Spring 2023 - Fall 2025) within the same organization where I supervised daily operations, ensured facility safety and efficiency, trained and led student-staff, fostering leadership development and resolving operational challenges.

Publications

Click entry for details

Benchmarking LLMs for Content Expansion: Measuring Novelty in Iterative Course Outline Generation

Ghimire V, Shrestha L.J., Dhungana B, Sadat N, Caporusso N (2025)

Iterative prompting is a technique commonly used to progressively refine the output produced by Large Language Models (LLMs) to obtain more comprehensive content. This method is particularly useful in the context of education for drafting and refining lists of topics to be included in course outlines, syllabi, and lecture plans. However, one of the main challenges in iterative prompting is determining whether, at each iteration, the model outputs truly new material or merely rephrases existing content. While iterative refinement is a common practice, it often results in repetition, which requires educators to conduct time-consuming manual checks, thereby limiting the benefits provided by LLM-assisted course design. To mitigate this, we introduce a framework that applies a novelty metric, integrating both lexical and semantic similarity measures, to automatically categorize generated outputs as novel, redundant, or requiring human judgment. We tested this framework across seven widely used LLMs (ChatGPT o3, Claude 4 Sonnet, Gemini 2.5 Pro, DeepSeek R1, Grok 3, Qwen 2.5, and Llama 3.3) to create a Java programming course outline over multiple iterations. Results from conversations containing over 23,751 individual topics showed that DeepSeek R1 produced the highest percentage of novel content. Moreover, our method successfully automated classification for 96.7% of the outputs, substantially reducing the proportion of items that require manual review.

[RESEARCH GATE]

Evaluating Output Novelty in Iterative Prompting in Educational Content Generation

Ghimire V, Shrestha L.J., Dhungana B, Sadat N, Caporusso N (2025)

Large Language Models (LLMs) have been explored as a valuable tool for creating course outlines. To this end, iterative prompting can be utilized to correct and expand the output. However, iterative prompting often results in LLMs producing redundant content that requires extensive manual review. In this context, distinguishing between genuinely novel content and rephrased existing information creates a trade-off between LLM capabilities and validation efforts. To address this, this paper proposes a novelty metric that combines lexical similarity mea- sures with semantic analysis to automatically classify generated content as novel or repeated, thus simplifying human review. We evaluated this approach using five widely used LLMs (ChatGPT 4o, Claude 3.7 Sonnet, Gemini 2.5 Flash, DeepSeek v3, and DeepSeek r1) to create a Java programming course outline over multiple iterations. Our methodology automated the classification of 90.52% of the generated content, significantly reducing the number of items requiring manual review. Moreover, our findings show that the proposed metric can also be utilized for ranking the performance of models in the context of iterative prompting.

View Poster

Click to open

[RESEARCH GATE]

Analysis of the Content of ChatGPT's Memory: Types of Information, Security Implications, and User Perception

Sadat N, Caporusso N, My Doan, Ghimire V, Dhungana B, Shrestha L.J. (2025)

OpenAI’s new “memory" feature enables ChatGPT to provide more personalized and relevant interactions by storing user information from the prompts and using it across conversations. While offering improved responses, the memory feature poses privacy and security challenges. This paper reports a three-fold study investigating ChatGPT’s memory feature in more detail. First, we utilized the Knowledge-Attitude-Behavior model and distributed a survey to over 135 users to assess their awareness of ChatGPT’s memory functionality, attitudes toward privacy implications, and the behavioral changes prompted by perceived risks. Secondly, memory content from over 55 user accounts was analyzed to evaluate the accuracy, relevance, and privacy of the stored data. Finally, we studied the distribution of the stored data across key categories to obtain insights into what kind of information ChatGPT considers relevant and stores. The findings reveal gaps in user understanding of the memory feature, the need for greater transparency, and the challenges of personalizing LLM agents while safeguarding privacy

View Research Poster

Click to open

[IEEE Explore]

Projects

Click entry for details

Road Risk Viewer

An interactive GIS web application to visualize road hazard and planning risk-aware travel across Nepal. Combines terrain analysis, seasonal rainfall data, and a dual-hazard risk model to score every road segment for landslide and flood susceptibility.

April 2026

Tech stack: GeoPandas, Shapely, Raster & DEM Analysis, OpenStreetMap, React-Leaflet

[GitHub]

Predictive Maintenance Framework

A predictive maintenance analysis framework with production-style simulation pipeline that processes hourly machinery telemetry and event data to predict failures and estimate RUL (Remaining Useful Life) of critical machines.

November 2025

Tech stack: Random Forest models, Pandas, FastAPI, Replit, Numpy, MCP Servers

[GitHub]

Knowtify

A local-first application that transforms your study materials into interactive flashcards and quizzes using LLMs locally - no cloud, no tracking, just effective learning.

August 2025

Tech stack: FastAPI, Ollama, LangChain, SQLAlchemy, DBsqlite

[GitHub]

RouteRishi

A smart travel planning assistant to help you budget, plan & create curated itinerary for your next trip with real-time flights & hotels info, weather based acitivites, and more.

July 2025

Tech stack: FastAPI, LangChain, Firebase, Firestore, Google Gen-AI

[GitHub]

Stock Insights

An Android application that empowers users to track, analyze, and forecast stock market trends, integrating live financial data with ML based predictions.

May 2025

Tech stack: Django, Websocket, Kotlin,SQLite, Facebook Prophet

[GitHub]

more at github.com/junggeyy