Tongtong Liu (Frank) WFU

Education Professional Experiences Awards Research Courses

Tongtong (Frank) Liu

I am currently a master's student at University of Pennsylvania, School of Engineering and Applied Science, studying Computer and Information Science. I graduated from Wake Forest University in 2023 with a B.S. in Computer Science with honors and B.S. in Mathematical Business. I was previously an undergraduate researcher advised by Dr. Sarra Alqahtani and worked as a research intern at IBM Research mentored by Dr. Mu Qiao in the summer of 2022.

I am the head Teaching Assistant for UPenn's graduate level Machine Learning class. My research interests lie in Large Language Models, Generative Models, robustness and security of Reinforcement Learning, explainable Reinforcement Learning (XRL), and Machine Learning Systems & Infrastructures.

Email / LinkedIn / GitHub / Google Scholar

Education

	University of Pennsylvania M.S.E in Computer and Information Science 2023.08 - 2025.05
	Wake Forest University B.S in Computer Science (with honors) [Honor Thesis] B.S. in Mathematical Business 2019.08 - 2023.05

Professional Experiences

	Meta Menlo Park, CA 2025.06 - Present Incoming Software Engineer
	ByteDance San Jose, CA 2024.05 - 2024.08 Software Engineer Intern
	IBM San Jose, CA 2022.05 - 2022.08 Research Intern

Awards

The John W. Sawyer Prize in Computer Science 2023 (for the top senior graduates in CS)
Academic Excellence in Math Business Award 2023
Finalist for the Computing Research Association's (CRA) Outstanding Undergraduate Researcher Award for 2023
Burke M. McConnell Management Excellence Scholarship, 2022
Wake Forest Research Fellowship, 2021

Research

My general research interests lie in different topics of Machine Learning, including data mining, foundation model based chatbot system in NLP, and security, safety, and robustness of Reinforcement Learning.

	Adversarial Behavior Exclusion for Safe Reinforcement Learning Md Asifur Rahman, Tongtong Liu , Sarra Alqahtani IJCAI, 2023 [paper] Adversarial Behavior Exclusion for Safe RL (AdvEx-RL) learns a behavioral representation of the agent’s safety violations by approximating an optimal adversary utilizing exploration and later uses this representation to learn a separate safety policy that excludes those unsafe behaviors.
	A Policy-Graph Approach to Explain Reinforcement Learning Agents: A Novel Policy-Graph Approach with Natural Language and Counterfactual Abstractions for Explaining Reinforcement Learning Agents Tongtong Liu , Joe McCalmon, Thai Le, Dongwon Lee, Sarra Alqahtani AAMAS Journal, 2023 [paper] In this work, we proposea novel approach that summarizes an agent’s policy in the form of a directed graph with natural language descriptions that help end user to understand the logic behind agent's decision. This work is submitted to JAAMAS.
	Weaponizing Actions in Multi-Agent Reinforcement Learning: Theoretical and Empirical Study on Security and Robustness Tongtong Liu, Joe McCalmon, Md Asifur Rahman, Cameron Lischke, Talal Halabi, Sarra Alqahtani PRIMA, 2022 [paper] [code] This paper investigates the robustness of c-MARL to a novel adversarial threat, where we target and weaponize one agent, termed the compromised agent, to create natural observations that are adversarial for its team. This paper shows mathematically the exploitation steps of such an adversarial policy in the centralized-learning and decentralized-execution paradigm of c-MARL. I presented this work at PRIMA 2022.
	Safe Reinforcement Learning via Observation Shielding Joe McCalmon, Tongtong Liu , Reid Goldsmith, Andrew Cyhaniuk, Talal Halabi, Sarra Alqahtani HICSS, 2023 [paper] [code] We proposed a method called observation-shielding RL (OSRL) to increase the robustness of RL against large perturbations using predictive models and threat detection. OSRL builds on the idea of model predictive shielding, where an observation predictive model is used to override the perturbed observations as needed to ensure safety.
	Empathetic Financial GPT (E-FinGPT): A Mixture-of-Expert Way of Building Customer Service Chatbot in Financial Domain Tongtong Liu , Mu Qiao, Divyesh Jadav IBM Research Internship, 2022 [patent] We proposed a novel mixture of expert method that combines few-shot learning and model fine-tuning on the SOTA foundation model decoder -- GPT -- to build a customer service chatbot that can respond to offensive customer complaints in a professional and empathetic way.
	LSTM-Based Anomalous Behavior Detection in Multi-Agent Reinforcement Learning Cameron Lischke, Tongtong Liu , Joe McCalmon, Md Asifur Rahman, Talal Halabi, Sarra Alqahtani IEEE CSR, 2022 [paper] [code] We present a novel stacked-LSTM ensemble approach to detect a serious vulnerability in Multi-agent Reinforcement Learning system, compromised agent attack, which one of the agent in the team is controlled by an attacker to subsequently pushes its cooperative agents to act off-policy.
	Multi-Agent Reinforcement Learning for Cooperative Adaptive Cruise Control Joe McCalmon, Ashley Peake, Benjamin Raiford, Tongtong Liu , Sarra Alqahtani ICTAI, 2020 [paper] A growing trend in the field of autonomous vehicles is the use of platooning. The design of control algorithms for platoons is challenging considering that coordination among vehicles is obtained through diverse communication channels. In this paper, we propose a multi-agent reinforcement learning approach for autonomous vehicles which communicate in a platoon formation.

Courses

Computer and Information Science (UPenn)

CIS5200: Machine Learning - A+ - Fall 2023
CIS5530: Networked System - A - Fall 2023
CIS5570: Programming For The Web - A+ - Fall 2023
CIS5050: Software Systems - A- - Spring 2024
CIS5500: Database and Information Systems - A - Spring 2024
CIS5510: Computer and Network Security - A+ - Spring 2024
CIS7000: Large Language Models - A - Fall 2024
ESE6450: Deep Generative Models - A - Fall 2024
CIS5100: Theory of Computation - Spring 2025
CIS5480: Operating Systems - Spring 2025
Head Teaching Assistant - CIS5190: Applied Machine Learning - Spring 2024, Fall 2024, Spring 2025
Teaching Assistant - CIS5200: Machine Learning - Fall 2024

Computer Science (Wake Forest)

CSC111: Introduction to Computer Science - A - Fall 2019
CSC112: Fundamentals of Computer Science - A - Spring 2020
CSC201: Data Structures and Algorithms - A - Fall 2020
CSC231: Programming Languages - A - Spring 2021
CSC250: Computer Systems I - A - Fall 2020
CSC251: Computer Systems II - A - Fall 2021
CSC301: Algorithm Design and Analysis - A - Spring 2021
CSC321: Database Management Systems - A - Spring 2022
CSC343: Internet Protocols - A - Spring 2022
CSC373: Data Mining - A - Fall 2021
CSC391: Selected Topic: Security and Trustworthiness of AI - A - Spring 2021
CSC391: Selected Topic: Cloud Computing - A - Spring 2022
BEM251: Management Information Systems - A - Spring 2022

Mathematics and Statistics (Wake Forest)

AP Credits: MST111: Calculus I
AP Credits: MST112: Calculus II
AP Credits: STA111: Elementary Probability&Stats
MST113: Multivariable Calculus - A - Fall 2022
MST117: Discrete Mathematics - A - Spring 2020
MST121: Linear Algebra - A - Fall 2019
MST253: Operations Research - A - Fall 2021
STA212: Statistical Models - A - Spring 2020
STA310: Probability - A - Spring 2022
STA362: Multivariate Statistics - A - Spring 2023
STA363: Intro to Statistical Learning - A - Fall 2022
BEM392: Seminar in Mathematical Business Analysis - A - Spring 2023

Website source code is adapted from here.