Samuel Maley

Greater Orlando, FL · (352) 630-0171 · samuel.e.maley@gmail.com

Fall 2024 Computer Science graduate from the University of Florida with interest and experience in Software Engineering, Data Science, and Machine/Deep Learning. Currently seeking full-time opportunities. Feel free to reach out!

Download PDF Resume

Experience

Machine Learning Research Intern

Applied Research Laboratory, Pennsylvania State University
  • Investigated the effects of data augmentation on satellite imagery as a distinct data modality in object detection
  • Utilized PyTorch, CUDA, Meta's Detectron2, and MMDetection to create data augmentation and model training pipelines for overhead image chips to compare each augmentation's effect on accuracy
  • Trained several deep learning model architectures for object detection, including: Shifted Window (Swin) transformer, 101-layer Residual Network (ResNet101), and YOLO
  • Performed dataset preprocessing to generate COCO style annotations for over 180,000 object instances across 2,800 images
  • Automated satellite imagery chipping to create 256x256 pixel image chips from iSAID dataset
May 2024 – Aug 2024

Software Engineering Intern

Dr. Xinde James Ji, University of Florida
  • Under Dr. Xinde James Ji, collaborated in a team of 4 to develope a framework using LangChain that utilizes Large Language Models (LLMs) and Retrieval Augmented Generation (RAG) to analyze and query large, legal documents
  • Implemented scalable, future-proof backend to handle the usage of multiple LLM APIs, including support for 9 different state-of-the-art models
  • Automated testing and analysis engine for performing accuracy checks on outputted results
  • Deployed Chroma vector database for efficient embedding retrieval for RAG
Jan 2024 – Present

Data Science Research Assistant

Data Science Research Lab, University of Florida
  • Researched various topics in video quesion-answering as part of the DARPA ECOLE project under Dr. Daisy Wang
  • Contributed to DARPA's Environment-driven Conceptual Learning (ECOLE) by researching potential improvements to a few-shot classification pipeline for complex video actions
  • Implemented Meta's Segment Anything Model (SAM) and Grouding-DINO to isolate and mask objects in videos from a single textual input
  • Deployed state-of-the-art computer vision models to analyze object affordances in videos, enabling the prediction of potential action locations
  • Created bash scripts utilizing FFMPEG to generate image and video test cases from the STAR dataset
Jan 2024 – Present

Undergraduate Research Assistant

Human Systems Engineering Lab, University of Florida
  • Conducted comprehensive analysis of human-robot interactions to identify potential risks and design safeguards, collaborating closely with cross-functional teams to research emerging technologies
  • Assisted in the design and use of EMG and IMU sensors to collect human participant data over 60+ hours of experiment data
  • Utilized pandas and numpy to prepare collected EMG and movement data for statistical inference. Assisted in the design of a VR human-robot interaction data pipeline through the use of ROSBridge, Unity, and Nvidia Omniverse
Aug 2023 – May 2024

Programming Tutor & Marketing Assistant

355Code
  • Provided personalized computer science instruction to students aged 7-16
  • Managed communications between parents
  • Assist with marketing and collaborate with manager to engineer effective advertisement strategies
  • Conducted cold calls and emails to expand business
Sept 2022 – Dec 2023

Education

University of Florida

Bachelor of Science in Computer Science
Major GPA: 3.51
Relevant Coursework: Machine Learning, Data Science, Software Engineering, Compilers, Senior Design, Operating Systems, Databases, Data Structures & Algorithms, Discrete Mathematics
Dec 2024

Projects

Pokémon Generation

Deep learning for generating unique Pokémon

PokéGAN is an ongoing project that aims to create AI-generated, 256x256 Pokémon sprites. This project includes: sprite generation, image-based type prediction, and name generation. The full gallery of 900 unique, fake Pokémon can be viewed below. Details of implementation can be found at the Github Link.

Final PNG Result

Machine Learning on Tic-Tac-Toe Dataset

Analysis of ML models for Tic-Tac-Toe

This repository contains the code for a machine learning project focused on building and evaluating various models on a tic-tac-toe dataset. The project involves implementing classifiers and regressors using libraries such as pandas, numpy, and scikit-learn, and demonstrates how these models can be used to predict optimal moves in a tic-tac-toe game.

GitHub Repository

Adversarial Attacks on Multimodal Entity Linking (MEL)

Analyis of the effects of data poisoning on state-of-the-art methods in MEL

Multimodal Entity Linking (MEL) is a task aimed to match entities with the corrent types of data across different formats, such as text, images, and videos. The goal is to connect mentions of these entities to the correct entries in a structured knowledge base. However, data poisoning attacks, where attackers deliberately inject misleading or malicious data into the training set, can significantly compromise the performance of these systems. This project investigates the effects of these attacks on the state-of-the-art methods.

View Research Paper

Final PNG Result

MediGator: AI Healthcare Assistant

AI healthcare chatbot and healthcare professional locator

Developed a full-stack Flask/React healthcare application featuring a Firebase-integrated database, Clerk-powered user authentication, and an LLM-driven chatbot. Led a team of 4 as Scrum Master, implemented database methods, designed React components, and integrated Google Maps API for a provider locator tool.

GitHub Repository

YouTube Video Summarizer

ChatGPT-powered video summarization

This project utilizes OpenAI's ChatGPT API and a Youtube transcript API to generate summaries of an inputted YouTube video.

GitHub Repository

AI-in-Classics: Ancient Latin/Greek Temporal Sentiment Analysis Tool

Sentiment Analysis Tool for ancient texts

Collaborated on a Flask/React application to analyze the sentiment of Latin and Greek words across ancient texts. Automated date estimation for 60,000 text segments using Wikipedia's API and LLM-powered scraping, while developing Flask methods and React components for seamless input and display.

GitHub Repository

Skills

Technical Skills
  • Languages: Python, C++, SQL, Java, JavaScript, HTML/CSS, ECL
  • Developer Tools: Git, Linux, Jupyter, Docker, Agile, Scrum, Jira
  • Libraries/Technologies: Scikit-Learn, Keras/TensorFlow, PyTorch, Torchvision, PIL, OpenCV, Pandas, NumPy, Matplotlib, OpenAIAPI, LangChain, SLURM, FFMPEG, React, Vite, NodeJS, Flask, Firebase