Skip to main content

Projects

Large Language Models & AI Safety

ProjectPaper TitleVenueDescriptionLinks
LLM-DNA
GitHub stars
LLM DNA: Tracing Model Evolution via Functional RepresentationsICLR 2026 (Oral)Training-free framework for tracing LLM evolution via functional representationsPaper Website
LLM-Deception
GitHub stars
Beyond Prompt-Induced Lies: Investigating LLM Deception on Benign PromptsICLR 2026 (Oral)Investigating LLM deceptive behavior on benign prompts using graph connectivity problemsarXiv
DGP
GitHub stars
DGP: A Dual-Granularity Prompting Framework for Fraud Detection with Graph-Enhanced LLMsAAAI 2026Dual-Granularity Prompting Framework for fraud detection with graph-enhanced LLMsarXiv
Llamdex
GitHub stars
Model-based Large Language Model Customization as ServiceEMNLP 2025 MainModel-based LLM customization service - upload models instead of dataPaper
MegaAgent
GitHub stars
MegaAgent: A Large-Scale Autonomous LLM-based Multi-Agent System Without Predefined SOPsACL 2025 FindingsLarge-scale autonomous LLM-based multi-agent system with dynamic task decompositionarXiv ACL
CryptoTrade
GitHub stars
CryptoTrade: A Reflective LLM-based Agent to Guide Zero-shot Cryptocurrency TradingEMNLP 2024Reflective LLM-based agent for cryptocurrency trading with on-chain and off-chain data analysisPaper

Federated Learning & Privacy

ProjectPaper TitleVenueDescriptionLinks
FeT
GitHub stars
Federated Transformer: Multi-Party Vertical Federated Learning on Practical Fuzzily Linked DataNeurIPS 2024Multi-party VFL framework for fuzzy identifiers (46% accuracy improvement at 50 parties)arXiv
LLM-PBE
GitHub stars
LLM-PBE: Assessing Data Privacy in Large Language ModelsSIGMOD 2024 (Best Paper Nomination)Toolkit for systematic evaluation of data privacy risks in LLMsWebsite
VertiBench
GitHub stars
VertiBench: Advancing Feature Distribution Diversity in Vertical Federated Learning BenchmarksICLR 2024Benchmark for vertical federated learning with diverse feature distributions and imbalancearXiv Website
ModelGo
GitHub stars
ModelGo: A Practical Tool for Machine Learning License AnalysisWWW 2024 (Oral)License analysis tool for machine learning projects with ML-specific licensing framework-
FedTree
GitHub stars
FedTree: A Federated Learning System For TreesMLSys 2023Federated learning system for tree-based models with HE, secure aggregation, and DPDocs
FedGMA
GitHub stars
Communication-Efficient Generalized Neuron Matching for Federated LearningICPP 2023Communication-efficient federated learning with generalized neuron matching-
FedOV
GitHub stars
Towards Addressing Label Skews in One-Shot Federated LearningICLR 2023One-shot federated learning framework addressing label skew challenges-
FedSim
GitHub stars
A Coupled Design of Exploiting Record Similarity for Practical Vertical Federated LearningNeurIPS 2022Coupled VFL framework leveraging record similarities for improved performance-
NIID-Bench
GitHub stars
Federated Learning on Non-IID Data Silos: An Experimental StudyICDE 2022Comprehensive FL benchmark for non-IID data with 4 algorithms and 9 datasets-

GPU-Accelerated Machine Learning

ProjectPaper TitleVenueDescriptionLinks
DeltaBoost
GitHub stars
DeltaBoost: Gradient Boosting Decision Trees with Efficient Machine UnlearningSIGMOD 2023 (Honorable Mention for Best Artifact Award)GBDT-based model with efficient machine unlearning capability-
ThunderSVM
GitHub stars
ThunderSVM: A Fast SVM Library on GPUs and CPUsJMLR 2018Fast SVM library on GPUs and CPUs with scikit-learn interfaceDocs
ThunderGBM
GitHub stars
Exploiting GPUs for Efficient Gradient Boosting Decision Tree TrainingIEEE TPDS 2019 (Best Paper), JMLR 2020Fast gradient boosted trees and random forests on GPUs (10x speedup)Docs

Graph Processing Systems

ProjectPaper TitleVenueDescriptionLinks
RidgeWalker
GitHub stars
RidgeWalker: Perfectly Pipelined Graph Random Walks on FPGAsHPCA 2026FPGA accelerator for graph random walks with zero-bubble scheduler-
Clementi
GitHub stars
Clementi: Efficient Load Balancing and Communication Overlap for Multi-FPGA Graph ProcessingSIGMOD 2025Multi-FPGA graph processing framework with near-linear scalability (1.86-8.75x speedup)-
RUSH
GitHub stars
RUSH: Real-time Burst Subgraph Detection in Dynamic GraphsVLDB 2024Real-time fraud detection framework for dynamic graphs with burst subgraph discoveryPaper
ThunderGP
GitHub stars
ThunderGP: Resource-Efficient Graph Processing Framework on FPGAs with HLSACM TRETS 2022 (Best Papers in FPGA 2021), FPGA 2021HLS-based graph processing framework on FPGAs (fastest on HLS-based FPGAs)-
G3
GitHub stars
G3: When Graph Neural Networks Meet Parallel Graph Processing Systems on GPUsVLDB 2020 DemoProgrammable GNN training system on GPU with graph-centric optimizationsDemo Video
Medusa
GitHub stars
Medusa: Simplified Graph Processing on GPUsIEEE TPDS 2013GPU-based parallel sparse graph processing with sequential C/C++ code-
RICH
GitHub stars
RICH: Real-time Identification of negative Cycles for High-efficiency arbitrage-Real-time negative cycle detection for arbitrage opportunities in token graphs-

Stream Processing

ProjectPaper TitleVenueDescriptionLinks
OEBench
GitHub stars
OEBench: Investigating Open Environment Challenges in Real-World Relational Data StreamsVLDB 2024Benchmark for open environment challenges in relational data streams (55 datasets)-
BriskStream
GitHub stars
BriskStream: Scaling Stream Processing on Multicore ArchitecturesSIGMOD 2019Multicore, NUMA-optimized data stream processing systemarXiv
PyOE
GitHub stars
PyOE: Python Library for Data Stream Learning-Machine learning library for data stream learning with 6 tasks supportWebsite

Hardware Acceleration & Optimization

ProjectPaper TitleVenueDescriptionLinks
HIPACK
GitHub stars
HiPACK: Efficient Sub-8-Bit Direct Convolution with SIMD and Bitwise ManagementMICRO 2025Sub-8-bit direct convolution acceleration for ARM processors (3.2x+ speedup)-