Vickey Ghimire

GIS | Software | Urban Resilience

Hello!
    I am Vickey, an incoming M.S. student at the University of Texas at Dallas, studying Geospatial Information Sciences with a focus on disaster resilience and urban planning. I received my B.S. in Computer Science from Northern Kentucky University, graduating Summa Cum Laude with minors in Mathematics & Information Technology.
    My interest in CS was rooted in the power of digital creation. Now, I am channeling that foundation into an applied direction: how can the intersection of spatial data and machine learning help build more resilient communities?
    Outside of academics, I like solving puzzles, playing and listening to music, or getting lost in a good novel or history book. I love figuring things out; whether it's a tricky bug, a plot twist, or a moment in time.

:)

Education

Click entry for details
M.S. in Geospatial Information Sciences
The University of Texas at Dallas
Honors: John Forrest Kain Scholarship
B.S. in Computer Science
Northern Kentucky University
Minors: Mathematics and Information Technology
GPA: 3.94
Honors: Summa Cum Laude, EDGE AWARD, President's Honors List

Experience

Click entry for details
Full-Stack Django Developer
Department of Physics, Geology & Engineering Technology, NKU
Worked on the web application for SIRES (Smart Integrated Renewable Energy Systems) project.

Key contributions include:

• Engineered a modular, class-based Django architecture refactoring a legacy tool into an MTV-compliant multi-user web platform with secure authentication and SQLite-based data persistence
• Built dynamic data pipelines integrating NASA POWER and OpenStreetMap APIs, replacing static spreadsheet extraction with location-specific environmental inputs.
• Optimized energy system (NGA-III) computation efficiency by ~35% through convergence-based termination and in-session result caching.
• Upgraded the frontend with Chart.js visualizations and refined UI typography, delivering interactive analytics on optimization results.
Undergraduate Research Assistant
College of Informatics, NKU
As a research assistant at NKU's Human-Computer Interaction Lab, I primarily worked on projects around language model behavior and natural language processing.
Key contributions include:
• Architected a novelty-detection framework using lexical (Jaccard) and semantic (BERT-based) measures to auto-classify LLM outputs, reducing manual review burden from 32.8% to 3.1% across 31,000+ generated topics.
• Led survey-based analysis of ChatGPT's memory feature across 135+ users and 55+ accounts, uncovering critical gaps in user awareness and data privacy practices.
• Co-authored 3 peer-reviewed papers (2 as first author), leading data analytics workflows from preprocessing and statistical modeling to data viz production.
Undergraduate Teaching Assistant
College of Informatics, NKU
As a Teaching Assistant for CSC 364: Data Structures and Algorithms in Java, I helped facilitate the course by supporting student learning and assisting the instructor as needed.
Course Instructor: Dr. Junxiu Zhou
Key responsibilities:
• Held weekly office hours mentoring 40+ students on DSA concepts, debugging, and Java problem-solving.
• Collaborated with instructor to develop Java test cases, design assessment rubrics, and streamline grading workflows.
Business Services Supervisor
NKU Campus Recreation Center
As a Business Services Supervisor (Student Employment) at Kentucky's largest recreation center, I oversaw membership services operations, supervised student employees, managed sales and membership reports, supported special events and more.
Before transitioning to this role, I also worked as a Facilities Manager (Spring 2023 - Fall 2025) within the same organization where I supervised daily operations, ensured facility safety and efficiency, trained and led student-staff, fostering leadership development and resolving operational challenges.

Publications

Click entry for details
Benchmarking LLMs for Content Expansion: Measuring Novelty in Iterative Course Outline Generation
Ghimire V, Shrestha L.J., Dhungana B, Sadat N, Caporusso N (2025)


Iterative prompting is a technique commonly used to progressively refine the output produced by Large Language Models (LLMs) to obtain more comprehensive content. This method is particularly useful in the context of education for drafting and refining lists of topics to be included in course outlines, syllabi, and lecture plans. However, one of the main challenges in iterative prompting is determining whether, at each iteration, the model outputs truly new material or merely rephrases existing content. While iterative refinement is a common practice, it often results in repetition, which requires educators to conduct time-consuming manual checks, thereby limiting the benefits provided by LLM-assisted course design. To mitigate this, we introduce a framework that applies a novelty metric, integrating both lexical and semantic similarity measures, to automatically categorize generated outputs as novel, redundant, or requiring human judgment. We tested this framework across seven widely used LLMs (ChatGPT o3, Claude 4 Sonnet, Gemini 2.5 Pro, DeepSeek R1, Grok 3, Qwen 2.5, and Llama 3.3) to create a Java programming course outline over multiple iterations. Results from conversations containing over 23,751 individual topics showed that DeepSeek R1 produced the highest percentage of novel content. Moreover, our method successfully automated classification for 96.7% of the outputs, substantially reducing the proportion of items that require manual review.

Evaluating Output Novelty in Iterative Prompting in Educational Content Generation
Ghimire V, Shrestha L.J., Dhungana B, Sadat N, Caporusso N (2025)


Large Language Models (LLMs) have been explored as a valuable tool for creating course outlines. To this end, iterative prompting can be utilized to correct and expand the output. However, iterative prompting often results in LLMs producing redundant content that requires extensive manual review. In this context, distinguishing between genuinely novel content and rephrased existing information creates a trade-off between LLM capabilities and validation efforts. To address this, this paper proposes a novelty metric that combines lexical similarity mea- sures with semantic analysis to automatically classify generated content as novel or repeated, thus simplifying human review. We evaluated this approach using five widely used LLMs (ChatGPT 4o, Claude 3.7 Sonnet, Gemini 2.5 Flash, DeepSeek v3, and DeepSeek r1) to create a Java programming course outline over multiple iterations. Our methodology automated the classification of 90.52% of the generated content, significantly reducing the number of items requiring manual review. Moreover, our findings show that the proposed metric can also be utilized for ranking the performance of models in the context of iterative prompting.

Analysis of the Content of ChatGPT's Memory: Types of Information, Security Implications, and User Perception
Sadat N, Caporusso N, My Doan, Ghimire V, Dhungana B, Shrestha L.J. (2025)


OpenAI’s new “memory" feature enables ChatGPT to provide more personalized and relevant interactions by storing user information from the prompts and using it across conversations. While offering improved responses, the memory feature poses privacy and security challenges. This paper reports a three-fold study investigating ChatGPT’s memory feature in more detail. First, we utilized the Knowledge-Attitude-Behavior model and distributed a survey to over 135 users to assess their awareness of ChatGPT’s memory functionality, attitudes toward privacy implications, and the behavioral changes prompted by perceived risks. Secondly, memory content from over 55 user accounts was analyzed to evaluate the accuracy, relevance, and privacy of the stored data. Finally, we studied the distribution of the stored data across key categories to obtain insights into what kind of information ChatGPT considers relevant and stores. The findings reveal gaps in user understanding of the memory feature, the need for greater transparency, and the challenges of personalizing LLM agents while safeguarding privacy

Projects

Click entry for details
Road Risk Viewer
An interactive GIS web application to visualize road hazard and planning risk-aware travel across Nepal. Combines terrain analysis, seasonal rainfall data, and a dual-hazard risk model to score every road segment for landslide and flood susceptibility.
Tech stack: GeoPandas, Shapely, Raster & DEM Analysis, OpenStreetMap, React-Leaflet
Predictive Maintenance Framework
A predictive maintenance analysis framework with production-style simulation pipeline that processes hourly machinery telemetry and event data to predict failures and estimate RUL (Remaining Useful Life) of critical machines.
Tech stack: Random Forest models, Pandas, FastAPI, Replit, Numpy, MCP Servers
Knowtify
A local-first application that transforms your study materials into interactive flashcards and quizzes using LLMs locally - no cloud, no tracking, just effective learning.
Tech stack: FastAPI, Ollama, LangChain, SQLAlchemy, DBsqlite
RouteRishi
A smart travel planning assistant to help you budget, plan & create curated itinerary for your next trip with real-time flights & hotels info, weather based acitivites, and more.
Tech stack: FastAPI, LangChain, Firebase, Firestore, Google Gen-AI
Stock Insights
An Android application that empowers users to track, analyze, and forecast stock market trends, integrating live financial data with ML based predictions.
Tech stack: Django, Websocket, Kotlin,SQLite, Facebook Prophet

more at github.com/junggeyy

CV

Click entry for details
Curriculum Vitae
View or download my CV
Resume
View or download my Resume