Michal Shlapentokh-Rothman
I am a 5th year computer science PhD student at the University of Illinois at Urbana-Champaign. I am co-advised by Professors Derek Hoiem and Yuxiong Wang. Previously, I received my bachelor's and Masters of Engineering degrees in computer science from MIT in 2019 and 2020. At MIT, I was a member of the ALFA Group .
I am looking for research internships for summer 2025.
Email / Google Scholar / LinkedIn / CV
|
|
Research
My overall research goal is to create adaptable and efficient visual agent systems, where foundation models can be composed and trained for a wide variety of downstream tasks with limited user input. During my undergrad, I did research on using evolutionary algorithms for cyber security.
|
|
Can We Generate Visual Programs Without Prompting LLMs?
Michal Shlapentokh-Rothman, Yu-Xiong Wang, Derek Hoiem
In Submission
We develop a synthetic data augmentation approach and alternative program generation method based on decoupling programs into higher-level skills called templates and the corresponding arguments. Our results show that with data augmentation, prompt-free smaller LLMs (approx. 1B parameters) are competitive with GPT-4o-mini on VQA tasks with the added benefit of much faster inference.
|
|
Region Representations Revisited
Michal Shlapentokh-Rothman* , Ansel Blume*, Yao Xiao, Yuqun Wu, Sethurame TV, Heyi Tao, Jae Yong Lee, Wilfredo Torres, Yu-Xiong Wang, Derek Hoiem
CVPR 2024
Project Page/
Arxiv/
Video
Region representations used to be popular in the pre-deep learning era. What happens when we create region representations with recently released founation models? We show that our simple method achieves impressive performance on existing tasks such as semantic segmentation as well as new one.
|
|
Language Agent Tree Search Unifies Reasoning, Acting and Planning in Language Models
Andy Zhou, Kai Yan, Michal Shlapentokh-Rothman, Haohan Wang, Yu-Xiong Wang
ICML 2024
Combines LLM techinques for differnet reasoning and planning benchmarks.
|
|
WebWISE: Web Interface Control and Sequential Exploration with Large Language Models
Heyi Tao*, Sethuramen TV*, Michal Shlapentokh-Rothman, Derek Hoiem
NAACL 2024
How can we use GPT-3 for web-based tasks? We investigate the performance of GPT-3 on the Mini-WoB dataset uisng Document Object Model (DOM) elements as part of the input.
|
|
Learning Curves for Analysis of Deep Networks
Derek Hoiem, Tanmay Gupta, Zhizhong Li, Michal M. Shlapentokh-Rothman
ICML 2021
In this work, we use learning curves to investigate the effects of various design choices on neural network performance. We propose a method for estimating learning curves, abstract their parameters into error and data-reliance, and evaluate the effectiveness of different parameterizations.
|
|
Coevolutionary Modeling of Cyber Attack Patterns and Mitigations Using Public Datasets
Michal Shlapentokh-Rothman, Jonathan Kelly, Avital Baral, Erik Hemberg, Una-May O’Reilly
GECCO 2021
In this work, we use co-evolutionary algorithms to explore the dynamics between cyber attack patterns and potential mitigations.
|
|
Securing the software defined perimeter with evolutionary co-optimizations
Michal Shlapentokh-Rothman, Erik Hemberg, Una-May O'Reilly
GECCO Workshop on Genetic and Evolutionary Computation in Defense, Security, and Risk Management
In this work, we show how we can use a competitive co-evolutionary framework to evaluate different software defined perimeters (SDP).
|
Teaching
Artificial Intelligence (CS 440): Fall 2020
Computational Photography (CS 445): Spring 2021, Spring 2023
|
|