Education    Professional Experiences    Awards    Research    Courses

Tongtong (Frank) Liu

I am currently a master's student at University of Pennsylvania, School of Engineering and Applied Science, studying Computer and Information Science. I graduated from Wake Forest University in 2023 with a B.S. in Computer Science with honors and B.S. in Mathematical Business. I was previously an undergraduate researcher advised by Dr. Sarra Alqahtani and worked as a research intern at IBM Research mentored by Dr. Mu Qiao in the summer of 2022.

I am the head Teaching Assistant for UPenn's graduate level Machine Learning class. My research interests lie in Large Language Models, Generative Models, robustness and security of Reinforcement Learning, explainable Reinforcement Learning (XRL), and Machine Learning Systems & Infrastructures.

Email  /  LinkedIn  /  GitHub  /  Google Scholar

profile photo
Education
upenn University of Pennsylvania
M.S.E in Computer and Information Science
2023.08 - 2025.05
wfu Wake Forest University
B.S in Computer Science (with honors) [Honor Thesis]
B.S. in Mathematical Business
2019.08 - 2023.05
Professional Experiences
meta Meta
Menlo Park, CA
2025.06 - Present
Incoming Software Engineer
bytedance ByteDance
San Jose, CA
2024.05 - 2024.08
Software Engineer Intern
ibm IBM
San Jose, CA
2022.05 - 2022.08
Research Intern
Awards
  1. The John W. Sawyer Prize in Computer Science 2023 (for the top senior graduates in CS)
  2. Academic Excellence in Math Business Award 2023
  3. Finalist for the Computing Research Association's (CRA) Outstanding Undergraduate Researcher Award for 2023
  4. Burke M. McConnell Management Excellence Scholarship, 2022
  5. Wake Forest Research Fellowship, 2021
Research

My general research interests lie in different topics of Machine Learning, including data mining, foundation model based chatbot system in NLP, and security, safety, and robustness of Reinforcement Learning.

platoon Adversarial Behavior Exclusion for Safe Reinforcement Learning
Md Asifur Rahman, Tongtong Liu , Sarra Alqahtani
IJCAI, 2023
[paper]

Adversarial Behavior Exclusion for Safe RL (AdvEx-RL) learns a behavioral representation of the agent’s safety violations by approximating an optimal adversary utilizing exploration and later uses this representation to learn a separate safety policy that excludes those unsafe behaviors.

platoon A Policy-Graph Approach to Explain Reinforcement Learning Agents: A Novel Policy-Graph Approach with Natural Language and Counterfactual Abstractions for Explaining Reinforcement Learning Agents
Tongtong Liu , Joe McCalmon, Thai Le, Dongwon Lee, Sarra Alqahtani
AAMAS Journal, 2023
[paper]

In this work, we proposea novel approach that summarizes an agent’s policy in the form of a directed graph with natural language descriptions that help end user to understand the logic behind agent's decision.

This work is submitted to JAAMAS.

platoon Weaponizing Actions in Multi-Agent Reinforcement Learning: Theoretical and Empirical Study on Security and Robustness
Tongtong Liu, Joe McCalmon, Md Asifur Rahman, Cameron Lischke, Talal Halabi, Sarra Alqahtani
PRIMA, 2022
[paper] [code]

This paper investigates the robustness of c-MARL to a novel adversarial threat, where we target and weaponize one agent, termed the compromised agent, to create natural observations that are adversarial for its team. This paper shows mathematically the exploitation steps of such an adversarial policy in the centralized-learning and decentralized-execution paradigm of c-MARL.

I presented this work at PRIMA 2022.

platoon Safe Reinforcement Learning via Observation Shielding
Joe McCalmon, Tongtong Liu , Reid Goldsmith, Andrew Cyhaniuk, Talal Halabi, Sarra Alqahtani
HICSS, 2023
[paper] [code]

We proposed a method called observation-shielding RL (OSRL) to increase the robustness of RL against large perturbations using predictive models and threat detection. OSRL builds on the idea of model predictive shielding, where an observation predictive model is used to override the perturbed observations as needed to ensure safety.

platoon Empathetic Financial GPT (E-FinGPT): A Mixture-of-Expert Way of Building Customer Service Chatbot in Financial Domain
Tongtong Liu , Mu Qiao, Divyesh Jadav
IBM Research Internship, 2022
[patent]

We proposed a novel mixture of expert method that combines few-shot learning and model fine-tuning on the SOTA foundation model decoder -- GPT -- to build a customer service chatbot that can respond to offensive customer complaints in a professional and empathetic way.

platoon LSTM-Based Anomalous Behavior Detection in Multi-Agent Reinforcement Learning
Cameron Lischke, Tongtong Liu , Joe McCalmon, Md Asifur Rahman, Talal Halabi, Sarra Alqahtani
IEEE CSR, 2022
[paper] [code]

We present a novel stacked-LSTM ensemble approach to detect a serious vulnerability in Multi-agent Reinforcement Learning system, compromised agent attack, which one of the agent in the team is controlled by an attacker to subsequently pushes its cooperative agents to act off-policy.

platoon Multi-Agent Reinforcement Learning for Cooperative Adaptive Cruise Control
Joe McCalmon, Ashley Peake, Benjamin Raiford, Tongtong Liu , Sarra Alqahtani
ICTAI, 2020
[paper]

A growing trend in the field of autonomous vehicles is the use of platooning. The design of control algorithms for platoons is challenging considering that coordination among vehicles is obtained through diverse communication channels. In this paper, we propose a multi-agent reinforcement learning approach for autonomous vehicles which communicate in a platoon formation.

Courses

Computer and Information Science (UPenn)

  • CIS5200: Machine Learning - A+ - Fall 2023
  • CIS5530: Networked System - A - Fall 2023
  • CIS5570: Programming For The Web - A+ - Fall 2023
  • CIS5050: Software Systems - A- - Spring 2024
  • CIS5500: Database and Information Systems - A - Spring 2024
  • CIS5510: Computer and Network Security - A+ - Spring 2024
  • CIS7000: Large Language Models - A - Fall 2024
  • ESE6450: Deep Generative Models - A - Fall 2024
  • CIS5100: Theory of Computation - Spring 2025
  • CIS5480: Operating Systems - Spring 2025
  • Head Teaching Assistant - CIS5190: Applied Machine Learning - Spring 2024, Fall 2024, Spring 2025
  • Teaching Assistant - CIS5200: Machine Learning - Fall 2024

Computer Science (Wake Forest)

  • CSC111: Introduction to Computer Science - A - Fall 2019
  • CSC112: Fundamentals of Computer Science - A - Spring 2020
  • CSC201: Data Structures and Algorithms - A - Fall 2020
  • CSC231: Programming Languages - A - Spring 2021
  • CSC250: Computer Systems I - A - Fall 2020
  • CSC251: Computer Systems II - A - Fall 2021
  • CSC301: Algorithm Design and Analysis - A - Spring 2021
  • CSC321: Database Management Systems - A - Spring 2022
  • CSC343: Internet Protocols - A - Spring 2022
  • CSC373: Data Mining - A - Fall 2021
  • CSC391: Selected Topic: Security and Trustworthiness of AI - A - Spring 2021
  • CSC391: Selected Topic: Cloud Computing - A - Spring 2022
  • BEM251: Management Information Systems - A - Spring 2022

Mathematics and Statistics (Wake Forest)

  • AP Credits: MST111: Calculus I
  • AP Credits: MST112: Calculus II
  • AP Credits: STA111: Elementary Probability&Stats
  • MST113: Multivariable Calculus - A - Fall 2022
  • MST117: Discrete Mathematics - A - Spring 2020
  • MST121: Linear Algebra - A - Fall 2019
  • MST253: Operations Research - A - Fall 2021
  • STA212: Statistical Models - A - Spring 2020
  • STA310: Probability - A - Spring 2022
  • STA362: Multivariate Statistics - A - Spring 2023
  • STA363: Intro to Statistical Learning - A - Fall 2022
  • BEM392: Seminar in Mathematical Business Analysis - A - Spring 2023

Website source code is adapted from here.