Publications
- DAC’25Late Breaking Results: A Fast Nearest Neighbor Search Acceleration for 3D Point CloudIn The 62nd Design Automation Conference , Jun 2025
- DAC’25Late Breaking Results: Source-Aware Adaptive Cache Management for CXL-enabled Disaggregated Memory SharingIn The 62nd Design Automation Conference , Jun 2025
- ICME’25Spectral Enhanced Tuning: A Plug-and-Play Framework for Dehazing Models with Frequency Decoding and FusionIn IEEE International Conference on Multimedia & Expo 2025 , Jun 2025
- ICCD’24UniCoMo: A Unified Learning-Based Cost Model for Tensorized Program TuningIn The 42nd IEEE International Conference on Computer Design , Nov 2024
- ICCD’24AutoSparse: A Source-to-Source Format and Schedule Auto-Tuning Framework for Sparse Tensor ProgramIn The 42nd IEEE International Conference on Computer Design , Nov 2024
- FPL’24SoGraph: A State-Aware Architecture for Out-of-Memory Graph Processing on HBM-Equipped FPGAsIn The 34th International Conference on Field-Programmable Logic and Applications , Sep 2024
- FPL’24Lora: A Latency-Oriented Recurrent Architecture for GPT on Multi-FPGA with Communication OptimizaionIn The 34th International Conference on Field-Programmable Logic and Applications , Sep 2024
- FCCM’24Ph.D. Project: Optimizing the Data Traffic for Large Graph Processing on FPGA via a Stateful ApproachIn IEEE 32st Annual International Symposium on Field-Programmable Custom Computing Machines , May 2024