Selected Publications
In Press
Journal Papers
- REC: Enhancing fine-grained cache coherence protocol in multi-GPU systems
- Gun Ko, Jiwon Lee, Hongju Kal, Hyunwuk Lee, and Won Woo Ro
Journal of Systems Architecture Vol. 160, Mar. 2025 
(IF: 4.5, Q1, JCR2022)
- HashScape: Leveraging Virtual Address Dynamics for Efficient Hashed Page Tables
- Won Hur, Jiwon Lee, Jaewon Kwon, Minjae Kim, and Won Woo Ro
IEEE Transactions on Computers 2025 
(IF: 3.6, Q2, JCR2023)
Conference Papers
- Ditto: Accelerating Diffusion Model via Temporal Value Similarity
- Sungbin Kim*, Hyunwuk Lee*, Wonho Cho, Mincheol Park, and Won Woo Ro
The 31st IEEE International Symposium on High-Performance Computer (HPCA), 2025   (IF: 4, NRF BK21four)
- Marching Page Walks: Batching and Concurrent Page Table Walks for Enhancing GPU Throughput
- Jiwon Lee, Gun Ko, Myung Kuk Yoon, Ipoom Jeong, Yunho Oh, and Won Woo Ro
The 31st IEEE International Symposium on High-Performance Computer (HPCA), 2025   (IF: 4, NRF BK21four)
- Qubit Movement-Optimized Program Generation on Zoned Neutral Atom Processors
- Enhyeok Jang, Youngmin Kim, Hyungseok Kim, Seungwoo Choi, Yipeng Huang, and Won Woo Ro
- The IEEE/ACM International Symposium on Code Generation and Optimization (CGO), 2025   (IF: 2, NRF BK21four)
Journal Papers
- SHREG: Mitigating Register Redundancy in GPUs
- Seunghyun Jin, Hyunwuk Lee, Jonghyun Lee, Junsung Kim, and Won Woo Ro
Journal of Systems Architecture Vol. 152, July. 2024 
(IF: 4.5, Q1, JCR2022)
Conference Papers
- DEPrune: Depth-wise Separable Convolution Pruning for Maximizing GPU Parallelism
- Cheonjun Park, Mincheol Park, Hyunchan Moon, Myung Kuk Yoon, Seokjin Go, Suhyun Kim, and Won Woo Ro
The 38th Annual Conference on Neural Information Processing Systems (NeurIPS), 2024   (IF: 4, NRF BK21four, Acceptance Rate: 25.8%)
- AirGun: Adaptive Granularity Quantization for Accelerating Large Language Models
- Sungbin Kim, Hyunwuk Lee, Sungwoo Kim, Cheolhwan Kim, and Won Woo Ro
- The International Conference on Computer Design (ICCD), 2024   (IF: 1, NRF BK21four, Acceptance Rate: 28%)
- MOSQ: Accelerating Classical Simulation of UCCSD Ansatz Circuits using Merged Operation
- Seungwoo Choi, Enhyeok Jang, Youngmin Kim, and Won Woo Ro
- The International Conference on Computer Design (ICCD), 2024   (IF: 1, NRF BK21four, Acceptance Rate: 28%)
- Generalizing Ray Tracing Accelerators for Tree Traversals on GPUs
- Dongho Ha*, Lufei Liu*, Yuan Hsi Chou, Seokjin Go, Won Woo Ro, Hung-Wei Tseng, and Tor M. Aamodt
The International Symposium on Microarchitecture (MICRO), 2024   (IF: 4, NRF BK21four, Acceptance Rate: 22.7%)
- Barber: Balancing Thermal Relaxation Deviations of NISQ Programs by Exploiting Bit-Inverted Circuits
- Enhyeok Jang, Seungwoo Choi, Youngmin Kim, Jeewoo Seo, and Won Woo Ro
The 2024 ACM/IEEE International Conference on Computer-Aided Design (ICCAD), 2024   (IF: 3, NRF BK21four, Acceptance Rate: 24%)
- Recompiling QAOA Circuits on Various Rotational Directions
- Enhyeok Jang, Dongho Ha, Seungwoo Choi, Youngmin Kim, Jaewon Kwon, Yongju Lee, Sungwoo Ahn, Hyungseok Kim, and Won Woo Ro
The 33rd International Conference on Parallel Architectures and Compilation Techniques (PACT), 2024   (IF: 3, NRF BK21four)
- M3XU: Achieving High-Precision and Complex Matrix Multiplication with Low-Precision MXUs
- Dongho Ha, Yunan Zhang, Chen-Chien Kao, Christopher J. Hughes, Won Woo Ro, and Hung-Wei Tseng
The International Conference for High Performance Computing, Networking, Storage, and Analysis (SC), 2024   (IF: 3, NRF BK21four, Acceptance Rate: 22.7%)
- GUMSO: Gating Unnecessary On-Chip Memory Slices for Power Optimization on GPUs
- Seunghyun Jin, Hyunwuk Lee, and Won Woo Ro
- The 2024 ACM/IEEE International Symposium on Low Power Electronics and Design (ISLPED), 2024   (IF: 1, NRF BK21four)
- Geneva: A Dynamic Confluence of Speculative Execution and In-Order Commitment Windows
- Yanghee Lee, Jiwon Lee, Jaewon Kwon, Yongju Lee, and Won Woo Ro
The 61th Design Automation Conference (DAC), 2024   (IF: 3, NRF BK21four, Acceptance Rate: 23%)
- REPrune: Channel Pruning via Kernel Representative Selection
- Mincheol Park, Dongjin Kim, Cheonjun Park, Yuna Park, Gyeong Eun Gong, Won Woo Ro, and Suhyun Kim
The 38th AAAI Conference on Artificial Intelligence (AAAI), 2024   (IF: 4, NRF BK21four, Acceptance Rate: 23.7% [2342/12100])
Journal Papers
- A Convertible Neural Processor Supporting Adaptive Quantization for Real-Time Neural Networks
- Hongju Kal, Hyoseong Choi, Ipoom Jeong, Joon-Sung Yang, and Won Woo Ro
Journal of Systems Architecture Vol. 145, Nov. 2023 
(IF: 4.5, Q1, JCR2022)
Conference Papers
- INTERPRET: Inter-Warp Register Reuse for GPU Tensor Cores
- Jae Seok Kwak, Myung Kuk Yoon, Ipoom Jeong, Seunghyun Jin, and Won Woo Ro
The 32th International Conference on Parallel Architectures and Compilation Techniques (PACT), 2023   (IF: 3, NRF BK21four)
- McCore: A Holistic Management of High-Performance Heterogeneous Multicores
- Jaewon Kwon, Yongju Lee, Hongju Kal, Minjae Kim, Youngsok Kim, and Won Woo Ro
The 56th International Symposium on Microarchitecture (MICRO), 2023   (IF: 4, NRF BK21four, Acceptance Rate: 23.8% [101/424])
- AESPA: Asynchronous Execution Scheme to Exploit Bank-Level Parallelism of Processing-in-Memory
- Hongju Kal, Chanyoung Yoo, and Won Woo Ro
The 56th International Symposium on Microarchitecture (MICRO), 2023   (IF: 4, NRF BK21four, Acceptance Rate: 23.8% [101/424])
- MAD MAcce: Supporting Multiply-Add Operations for Democratizing Matrix-Multiplication Accelerator
- Seunghwan Sung, Sujin Hur, Dongho Ha, Sungwoo Kim, Yunho Oh, and Won Woo Ro
The 56th International Symposium on Microarchitecture (MICRO), 2023   (IF: 4, NRF BK21four, Acceptance Rate: 23.8% [101/424])
- Exploiting Inherent Properties of Complex Numbers for Accelerating Complex Valued Neural Networks
- Hyunwuk Lee, Hyungjun Jang, Sungbin Kim, Sungwoo Kim, Wonho Cho, and Won Woo Ro
The 56th International Symposium on Microarchitecture (MICRO), 2023   (IF: 4, NRF BK21four, Acceptance Rate: 23.8% [101/424])
- TensorCV: Accelerating Inference-Adjacent Computation Using Tensor Processors
- Dongho Ha, Won Woo Ro, and Hung-Wei Tseng
- The 2023 ACM/IEEE International Symposium on Low Power Electronics and Design (ISLPED), 2023   (IF: 1, NRF BK21four)
- R2D2: Removing ReDunDancy Utilizing Linearity of Address Generation in GPUs
- Dongho Ha, Yunho Oh, and Won Woo Ro
The 50th International Symposium on Computer Architecture (ISCA), 2023   (IF: 4, NRF BK21four, Acceptance Rate: 21.2% [79/372])
- Early-Adaptor: An Adaptive Framework for Proactive UVM Memory Management
- Seokjin Go, Hyunwuk Lee, Junsung Kim, Jiwon Lee, Myung Kuk Yoon, and Won Woo Ro
- The 2023 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), 2023   (IF: 1, NRF BK21four, Acceptance Rate: 37.8%)
- Lightning Talk: Efficiency and Programmability of DNN Accelerators and GPUs
- Won Woo Ro
The 60th Design Automation Conference (DAC), 2023   (IF: 3, NRF BK21four, Acceptance Rate: 23%)
- Quixote: Improving Fidelity of Quantum Program by Independent Execution of Controlled Gates
- Enhyeok Jang, Seungwoo Choi, and Won Woo Ro
The 60th Design Automation Conference (DAC), 2023   (IF: 3, NRF BK21four, Acceptance Rate: 23%)
- Balanced Column-Wise Block Pruning for Maximizing GPU Parallelism
- Cheonjun Park, Mincheol Park, Hyun Jae Oh, Minkyu Kim, Myung Kuk Yoon, Suhyun Kim, and Won Woo Ro
The 37th AAAI Conference on Artificial Intelligence (AAAI), 2023   (IF: 4, NRF BK21four, Oral Acceptance Rate: 10.8% [952/8777], Oral Presentation)
- SnakeByte: A TLB Design with Adaptive and Recursive Page Merging in GPUs
- Jiwon Lee, Ju Min Lee, Yunho Oh, William J. Song, and Won Woo Ro
The 29th IEEE International Symposium on High-Performance Computer (HPCA), 2023   (IF: 4, NRF BK21four, Acceptance Rate: 25.0% [91/364])
Journal Papers
- TEA-RC: Thread Context-Aware Register Cache for GPUs
- Ipoom Jeong, Yunho Oh, Won Woo Ro, and Myung Kuk Yoon
IEEE Access  
(IF: 3.476, Q2, JCR2021)
- CASH-RF: A Compiler-Assisted Hierarchical Register File in GPUs
- Yunho Oh, Ipoom Jeong, Won Woo Ro, and Myung Kuk Yoon
IEEE Embedded Systems Letters  
(IF: 2.169, Q2, JCR2020)
- FLIXR: Embedding Index into Flash Translation Layer in SSDs
- Gunjae Koo, Yunho Oh, Hung-Wei Tseng, Won Woo Ro, and Murali Annavaram
IEEE Transactions on Computers, doi: 10.1109/TC.2022.3154602., Feb. 2022  
(IF: 2.663, Q2, JCR2020)
Conference Papers
- Reconstructing Out-of-Order Issue Queue
- Ipoom Jeong, Jiwon Lee, Myung Kuk Yoon, and Won Woo Ro
The 55th IEEE/ACM International Symposium on Microarchitecture (MICRO), 2022   (IF: 4, NRF BK21four, Acceptance Rate: 23.8% [83/348])
Journal Papers
- Two-Stage In-Storage Processing and Scheduling for Pattern Matching Applications
- Joohyeong Yoon, Yoonjin Lee, Won Seob Jeong, and Won Woo Ro
IEEE Access, Vol. 9, pp. 95702-95715, Jun. 2021  
(IF: 3.367, Q1, JCR2020)
- PIMCaffe: Functional Evaluation of a Machine Learning Framework for In-Memory Neural Processing Unit
- Won Jeon, Jiwon Lee, Dongseok Kang, Hongju Kal, and Won Woo Ro
IEEE Access, Vol. 9, pp. 96629-96640, Jul. 2021  
(IF: 3.367, Q1, JCR2020)
Conference Papers
- SPACE: Locality-Aware Processing in Heterogeneous Memory for Personalized Recommendations
- Hongju Kal, Seokmin Lee, Gun Ko, and Won Woo Ro
The 48th ACM/IEEE International Symposium on Computer Architecture (ISCA), 2021   (IF: 4, NRF BK21four, Acceptance Rate: 18.7% [76/406])
Journal Papers
- Hi-End: Hierarchical, Endurance-Aware STT-MRAM-Based Register File for Energy-Efficient GPUs
- Won Jeon, Jun Hyun Park, Yoonsoo Kim, Gunjae Koo, and Won Woo Ro
IEEE Access, Vol. 8, pp. 127768-127780, Jul. 2020  
(IF: 3.745, Q1, JCR2019)
- REACT: Scalable and High-Performance Regular Expression Pattern Matching Accelerator for In-Storage Processing
- Won Seob Jeong, Changmin Lee, Keunsoo Kim, Myung Kuk Yoon, Won Jeon, Myoungsoo Jung, and Won Woo Ro
IEEE Transactions on Parallel and Distributed Systems, Vol. 31, Issue 5, pp.1137-1151, May 2020  
(IF: 3.402, Q1, JCR2018)
Conference Papers
- Duplo: Lifting Redundant Memory Accesses of Neural Networks for GPU Tensor Cores
- Hyeonjin Kim, Sungwoo Ahn, Yunho Oh, Bogil Kim, Won Woo Ro, and William J. Song
The 53rd IEEE/ACM International Symposium on Microarchitecture (MICRO), 2020   (IF: 4, NRF BK21four, Acceptance Rate: 19.4% [82/422])
- Check-In: In-Storage Checkpointing for Key-Value Store System Leveraging Flash-Based SSDs
- Joohyeong Yoon, Won Seob Jeong, and Won Woo Ro
The 47th ACM/IEEE International Symposium on Computer Architecture (ISCA), 2020   (IF: 4, NRF BK21+, Acceptance Rate: 18.2% [77/421])
- CASINO Core Microarchitecture: Generating Out-of-Order Schedules Using Cascaded In-Order Scheduling Windows
- Ipoom Jeong, Seihoon Park, Changmin Lee, and Won Woo Ro
The 26th IEEE International Symposium on High Performance Computer Architecture (HPCA), 2020   (IF: 4, NRF BK21+, Acceptance Rate: 16.9% [48/284])
Journal Papers
- OverCome: Coarse-Grained Instruction Commit with Handover Register Renaming
- Ipoom Jeong, Changmin Lee, Keunsoo Kim, and Won Woo Ro
IEEE Transactions on Computers, Vol. 68, Issue 12, pp. 1802-1816, Dec. 2019
(IF: 3.131, Q1, JCR2018)
- Contents-Aware Partitioning Algorithm for Parallel High Efficiency Video Coding
- Kyungah Kim and Won Woo Ro
- Multimedia Tools and Applications, Vol. 78, Issue 9, pp. 11427-11442, May 2019
(IF: 2.101, Q3, JCR2018)
- Fast CU Depth Decision for HEVC using Neural Networks
- Kyungah Kim and Won Woo Ro
IEEE Transactions on Circuits and Systems for Video Technology, Vol. 29, No. 5, pp. 1462-1473, May 2019
(IF: 4.046, Q1, JCR2018)
- Adaptive Cooperation of Prefetching and Warp Scheduling on GPUs
- Yunho Oh, Keunsoo Kim, Myung Kuk Yoon, Jong Hyun Park, Yongjun Park, Murali Annavaram, and Won Woo Ro
IEEE Transactions on Computers, Vol. 68, No. 4, pp. 609-616, Apr. 2019
(IF: 3.131, Q1, JCR2018)
Conference Papers
- Efficient Dilated-Winograd Convolutional Neural Networks
- Minsik Kim, Cheonjun Park, Sungjun Kim, Taeyoung Hong, and Won Woo Ro
- The 2019 IEEE International Conference on Image Processing (ICIP), 2019 Taipei, Taiwan, Sep. 22 - 25(Acceptance Rate: 46.2% [956/2068])
- Linebacker: Preserving Victim Cache Lines in Idle Register Files of GPUs
- Yunho Oh, Gunjae Koo, Murali Annavaram, and Won Woo Ro
The 46th ACM/IEEE International Symposium on Computer Architecture (ISCA), 2019 Phoenix, Arizona, USA, Jun. 22 - 26 (IF: 4, NRF BK21four, Acceptance Rate:17.0% [62/365])
Journal Papers
- WASP: Selective Data Prefetching with Monitoring Runtime Warp Progress on GPUs
- Yunho Oh, Myung Kuk Yoon, Jong Hyun Park, Yongjun Park, and Won Woo Ro
IEEE Transactions on Computers, Vol. 67, No. 9, pp. 1366-1373, Sep. 2018
(IF: 3.052, Q1, JCR2017)
- Exploiting Pseudo-Quadtree Structure for Accelerating HEVC Spatial Resolution Downscaling Transcoder
- Minsik Kim, Minyong Sung, Minwoo Kim, and Won Woo Ro
IEEE Transactions on Multimedia, Vol. 20, No. 9, pp. 2262-2275, Sep. 2018
(IF: 3.977, Q1, JCR2017)
- Architectural Protection of Application Privacy against Software and Physical Attacks in Untrusted Cloud Environment
- Lei Xu, JongHyuk Lee, Seung Hun Kim, Qingji Zheng, Shouhuai Xu, Taeweon Suh, Won Woo Ro, and Weidong Shi
IEEE Transactions on Cloud Computing, Vol. 6, No. 2, pp. 478-491, Apr-Jun. 2018
(IF: 7.928, Q1, JCR2017)
- Simultaneous and Speculative Thread Migration for Improving Energy Efficiency of Heterogeneous Core Architectures
- Changmin Lee and Won Woo Ro
IEEE Transactions on Computers, Vol. 67, No. 4, pp. 498-512, Apr. 2018
(IF: 3.052, Q1, JCR2017)
Conference Papers
- FineReg: Fine-Grained Register File Management for Augmenting GPU Throughput
- Yunho Oh, Myung Kuk Yoon, William J. Song, and Won Woo Ro
- The 51st IEEE/ACM International Symposium on Microarchitecture
(MICRO 2018) Fukuoka, Japan, Oct. 20 - 24, 2018
(IF: 4, NRF BK21+,Acceptance Rate:21.1% [74/351])
- WIR: Warp Instruction Reuse to Minimize Repeated Computations in GPUs
- Keunsoo Kim and Won Woo Ro
- The 24th IEEE International Symposium on High Performance Computer Architecture
(HPCA 2018)Wien, Austria, Feb. 24 - 28, 2018
(IF: 4, NRF BK21+, Acceptance Rate:20.8% [54/260])-
Journal Papers
- Dynamic Resizing on Active Warps Scheduler to Hide Operation Stalls on GPUs
- Myung Kuk Yoon, Yunho Oh, Sangpil Lee, Seung Hun Kim, Deokho Kim, and Won Woo Ro
IEEE Transactions on Parallel and Distributed Systems, Vol. 28, No. 11, pp. 3142-3156, Nov. 2017 (IF: 4.181, Q1, JCR2016)
- Dynamic Load Balancing of Dispatch Scheduling for Solid State Disks
- Myunghyun Jo and Won Woo Ro
IEEE Transactions on Computers, Vol. 66, No. 6, pp. 1034-1047, Jun. 2017 (IF: 2.916, Q1, JCR2016)
- Improving Energy Efficiency of GPUs through Data Compression and Compressed Execution
- Sangpil Lee, Keunsoo Kim, Gunjae Koo, Hyeran Jeon, Murali Annavaram, and Won Woo Ro
IEEE Transactions on Computers, Vol. 66, No. 5, pp. 834-847, May 2017 (IF: 2.916, Q1, JCR2016)
Conference Papers
- Access Pattern-Aware Cache Management for Improving Data Utilization in GPU
- Gunjae Koo, Yunho Oh, Won Woo Ro, and Murali Annavaram
- The 44th ACM/IEEE International Symposium on Computer Architecture
(ISCA 2017) Torronto, Canada, Jun. 24 - 28, 2017 (IF: 4, NRF BK21+, Acceptance Rate:16.8% [54/322])
Journal Papers
- Server Side, Play Buffer Based Quality Control for Adaptive Media Streaming
- Keunsoo Kim, Benjamin Y. Cho, and Won Woo Ro
- Multimedia Tools and Applications, Vol. 75, No. 10, pp. 5397-5415, May 2016 (IF: 1.331, Q2, JCR2015)
- Exploiting Thread-Level Parallelism on HEVC by Employing Reference Dependency Graph
- Minwoo Kim, Deokho Kim, Kyungah Kim, and Won Woo Ro
IEEE Transactions on Circuits and Systems for Video Technology, Vol. 26, No. 4, pp. 736-749, Apr. 2016 (IF: 2.254, Q1, JCR2015)
- Parallel GPU Architecture Simulation Framework Exploiting Architectural-Level Parallelism with Timing Error Prediction
- Sangpil Lee and Won Woo Ro
IEEE Transactions on Computers, Vol. 65, No. 4, pp. 1253-1265, Apr. 2016 (IF: 1.723, Q1, JCR2015)
Conference Papers
- Virtual Thread: Maximizing Thread-Level Parallelism beyond GPU Scheduling Limit
- Myung Kuk Yoon, Keunsoo Kim, Sangpil Lee, Won Woo Ro, and Murali Annavaram
- The 43rd ACM/IEEE International Symposium on Computer Architecture
(ISCA 2016)Seoul, Korea, Jun. 18 - 22, 2016 (IF: 4, NRF BK21+)
- APRES: Improving Cache Efficiency by Exploiting Load Characteristics on GPUs
- Yunho Oh, Keunsoo Kim, Myung Kuk Yoon, Jong Hyun Park, Yongjun Park, Won Woo Ro, and Murali Annavaram
- The 43rd ACM/IEEE International Symposium on Computer Architecture
(ISCA 2016)Seoul, Korea, Jun. 18 - 22, 2016 (IF: 4, NRF BK21+)
- Warped-Slicer: Efficient Intra-SM Slicing through Dynamic Resource Partitioning for GPU Multiprogramming
- Qiumin Xu, Hyeran Jeon, Keunsoo Kim, Won Woo Ro, and Murali Annavaram
- The 43rd ACM/IEEE International Symposium on Computer Architecture
(ISCA 2016) Seoul, Korea, Jun. 18 - 22, 2016 (IF: 4, NRF BK21+)
- Warped-Preexecution: A GPU Pre-execution Approach for Improving Latency Hiding
- Keunsoo Kim, Sangpil Lee, Myung Kuk Yoon, Gunjae Koo, Won Woo Ro, and Murali Annavaram
- The 22nd IEEE International Symposium on High Performance Computer Architecture
(HPCA 2016)Barcelona, Spain, Mar. 12 - 16, 2016 (IF: 4, NRF BK21+)
Journal Papers
- A Performance-Energy Model to Evaluate Single Thread Execution Acceleration
- Seung Hun Kim, Dohoon Kim, Changmin Lee, Won Seob Jeong, Won Woo Ro, and Jean-Luc Gaudiot
- IEEE Computer Architecture Letters, Vol.14, No.2, pp. 99-102, Dec. 2015 (IF: 0.677, Q3, JCR2014)
- Dynamic Load Balancing of Parallel SURF with Vertical Partitioning
- Deokho Kim, Minwoo Kim, Kyungah Kim, Minyong Sung, and Won Woo Ro
IEEE Transactions on Parallel and Distributed Systems, Vol. 26, No. 12, pp. 3358-3370, Dec. 2015 (IF: 2.170, Q1, JCR2014)
- Network Variation and Fault Tolerant Performance Acceleration in Mobile Devices with Simultaneous Remote Execution
- Keunsoo Kim, Benjamin Y. Cho, Won Woo Ro, and Jean-Luc Gaudiot
IEEE Transactions on Computers, Vol. 64, No. 10, pp. 2862-2874, Oct. 2015 (IF: 1.659, Q1, JCR2014)
- Highly Secure Mobile Devices Assisted with Trusted Cloud Computing Environments
- Doohwan Oh, Ilkyu Kim, Keunsoo Kim, Sang-Min Lee, and Won Woo Ro
- ETRI Journal, Vol. 37, No. 2, pp. 348-358, Apr. 2015 (IF: 0.771, Q3, JCR2014)
Conference Papers
- True Motion Compensation With Feature Detection for Frame Rate Up-Conversion
- Kyungah Kim, Minwoo Kim, Deokho Kim, and Won Woo Ro
- The 2015 IEEE International Conference on Image Processing
- (ICIP 2015) Quebec City, Canada, Sep. 27 - 30, 2015
- An Accelerated Separable Median Filter with Sorting Networks
- Minsik Kim, Deokho Kim, Minyong Sung, and Won Woo Ro
- The 2015 IEEE International Conference on Image Processing
- (ICIP 2015) Quebec City, Canada, Sep. 27 - 30, 2015
- Enhancing Software Dependability and Security with Hardware Supported Instruction Address Space Randomization
- Seung Hun Kim, Lei Xu, Ziyi Liu, Zhiqiang Lin, Won Woo Ro, and Weidong Shi
- The 45th Annual IEEE/IFIP International Conference on Dependable Systems and Networks
- (DSN 2015) Rio de Janerio, Brazil, Jun. 22 - 25, 2015
- Warped-Compression: Enabling Power Efficient GPUs through Register Compression
- Sangpil Lee, Keunsoo Kim, Gunjae Koo, Hyeran Jeon, Won Woo Ro, and Murali Annavaram
- The 42nd ACM/IEEE International Symposium on Computer Architecture
(ISCA 2015) Portland, OR, USA, Jun. 13 - 17, 2015 (IF: 4, NRF BK21+)
- DRAW: Investigating Benefits of Adaptive Fetch Group Size on GPU
- Myung Kuk Yoon, Yunho Oh, Sangpil Lee, Seung Hun Kim, Deokho Kim, and Won Woo Ro
- The 2015 IEEE International Symposium on Performance Analysis of Systems and Software
- (ISPASS 2015) Philadelphia, PA, USA, Mar. 29 - 31, 2015
Journal Papers
- A Malicious Pattern Detection Engine for Embedded Security Systems in Internet of Things
- Doohwan Oh, Deokho Kim, and Won Woo Ro
- Sensors, Vol. 14, No. 12, pp. 24188-24211, Dec. 2014 (IF: 2.048, Q2, JCR2013)
C-Lock: Energy Efficient Synchronization for Embedded Multicore Systems
- Seung Hun Kim, Sang Hyong Lee, Minje Jun, Byunghoon Lee, Won Woo Ro, Eui-Young Chung,
and Jean-Luc Gaudiot
- IEEE Transactions on Computers, Vol. 63, No. 8, pp. 1962-1974, Aug. 2014 (IF: 1.473, Q2, JCR2013)
- Swarm Processor System: Hardware Process Scheduler based Energy Efficient Multi-Core System
- Won Seob Jeong, Seung Hun Kim, Sang-Min Lee, and Won Woo Ro
- IEICE Electronics Express, Vol. 11, No. 14, pp. 20140424, July 2014 (IF: 0.391, Q4, JCR2013)
- Complexity-Effective Contention Management with Dynamic Backoff for Transactional Memory Systems
- Seung Hun Kim, Dongmin Choi, Won Woo Ro, and Jean-Luc Gaudiot
- IEEE Transactions on Computers, Vol. 63, No. 7, pp. 1696-1708, July 2014 (IF: 1.473, Q2, JCR2013)
- Architectural Investigation of Matrix Data Layout on Multicore Processors
- Minwoo Kim and Won Woo Ro
Future Generation Computer Systems, Vol. 37, pp. 64-75, July 2014 (IF: 2.639, Q1, JCR2013)
- Exploiting Implementation Diversity and Partial Connection of Routers in Application-Specific Network-on-Chip Topology Synthesis
- Minje Jun, Won Woo Ro, and Eui-Young Chung
- IEEE Transactions on Computers, Vol. 63, No. 6, pp. 1434-1445, Jun. 2014 (IF: 1.473, Q2, JCR2013)
- Accelerating MapReduce Framework on Multi-GPU Systems
- Hai Jiang, Yi Chen, Zhi Qiao, Kuan-Ching Li, Won Woo Ro, and Jean-Luc Gaudiot
- Cluster Computing, Vol. 17, No. 2, pp. 293-301, Jun. 2014 (IF: 0.949, Q3, JCR2013)
- Boosting CUDA Applications with CPU-GPU Hybrid Computing
- Changmin Lee, Won Woo Ro, and Jean-Luc Gaudiot
- International Journal of Parallel Programming, Vol. 42, No. 2, pp. 384-404, Apr. 2014 (IF: 0.500, Q4, JCR2013)
- This is an extension of our INTERACT-16 paper which has been selected as one of the best papers and recommended to IJPP.
Conference Papers
- LUT based Secure Cloud Computing - an Implementation using FPGAs
- Lei Xu, Pham Dang Khoa, Seung Hun Kim, Won Woo Ro, and Weidong Shi
- 2014 International Conference on ReConFigurable Computing and FPGAs
- (ReConFig 2014) Cancun, Mexico, Dec. 7 - 10, 2014
- Workload Synthesis: Generating Benchmark Workloads from Statistical Execution Profile
- Keunsoo Kim, Changmin Lee, Jung Ho Jung, and Won Woo Ro
- IEEE International Symposium on Workload Characterization
- (IISWC 2014) Raleigh, North Carolina, USA, Oct. 26 - 28, 2014
Journal Papers
- Parallelized Sub-Resource Loading for Web Rendering Engine
- Deokho Kim, Changmin Lee, Sangpil Lee, and Won Woo Ro
- Journal of Systems Architecture, Vol. 59, No. 9, pp. 785-793, Oct. 2013 (IF: 0.724, Q3, JCR2012)
- Design and Evaluation of Random Linear Network Coding Accelerators on FPGAs
- Sunwoo Kim, Won Seob Jeong, Won Woo Ro, and Jean-Luc Gaudiot
- ACM Transactions on Embedded Computing Systems, Vol.13, No. 1, pp. 1-24, Aug. 2013 (IF: 1.178, Q2, JCR2012)
- GPU-Friendly Parallel Genome Matching with Tiled Access and Reduced State Transition Table
- Yunho Oh, Doohwan Oh, and Won Woo Ro
- International Journal of Parallel Programming, Vol. 41, No. 4, pp. 526-551, Aug. 2013 (IF: 0.404, Q4, JCR2012)
- A Distributed Signature Detection Method for Detecting Intrusions in Sensor Systems
- Ilkyu Kim, Doohwan Oh, Myung Kuk Yoon, Kyueun Yi, and Won Woo Ro
- Sensors, Vol. 13, No. 4, pp. 3998-4016, Mar. 2013 (IF: 1.953, Q3, JCR2012)
- Exploiting SIMD Parallelism on Dynamically Partitioned Parallel Network Coding for P2P Systems
- Deokho Kim, Karam Park, and Won Woo Ro
- Computers & Electrical Engineering, Vol. 39, No. 1, pp. 55-56, Jan. 2013 (IF: 0.928, Q3, JCR2012)
- Benefits of Using Parallelized Non-Progressive Network Coding
- Minwoo Kim, Karam Park, and Won Woo Ro
Journal of Network and Computer Applications, Vol. 36, No. 1, pp. 293-305, Jan. 2013 (IF: 1.467, Q1, JCR2012)
- Importance of Coherence Protocols with Network Applications on Multi-Core Processors
- Kyueun Yi, Won Woo Ro, and Jean-Luc Gaudiot
- IEEE Transactions on Computers, Vol. 62, No. 1, pp. 6-15, Jan. 2013 (IF: 1.379, Q2, JCR2012)
Conference Papers
- XSD: Accelerating MapReduce by Harnessing the GPU inside an SSD
- Benjamin Y. Cho, Won Seob Jeong, Doohwan Oh, and Won Woo Ro
- The 1st Workshop on Near-Data Processing. In conjunction with the MICRO-46
- (WoNDP 2013) Davis, USA, Dec. 8, 2013
- Mark-Sharing: A Parallel Garbage Collection Algorithm for Low Synchronization Overhead
- Hyunkyu Park, Changmin Lee, Seung Hun Kim, Won Woo Ro and Jean-Luc Gaudiot
- The 19th IEEE International Conference on Parallel and Distributed Systems
- (ICPADS 2013) Seoul, Korea, Dec. 15 - 18, 2013
- MGMR: Multi-GPU Based MapReduce
- Yi Chen, Zhi Qiao, Hai Jiang, Kuan-Ching Li, Won Woo Ro
- The 8th International Conference on Grid and Pervasive Computing
- (GPC 2013) Seoul, Korea, May. 9 - 11, 2013
- Parallel GPU Architecture Simulation Framework Exploiting Work Allocation Unit Parallelism
- Sangpil Lee and Won Woo Ro
- The 2013 IEEE International Symposium on Performance Analysis of Systems and Software
- (ISPASS 2013) Austin, TX, USA, Apr. 21 - 23, 2013
Journal Papers
- Multi-Threading and Suffix Grouping on Massive Multiple Pattern Matching Algorithm
- Doohwan Oh and Won Woo Ro
- The Computer Journal, Vol. 55, No. 11, pp. 1331-1346, Nov. 2012 (IF: 0.785, Q3, JCR2011)
- Offloading of Media Transcoding for High-Quality Multimedia Services
- Seung Hun Kim, Keunsoo Kim, Changmin Lee, and Won Woo Ro
- IEEE Transactions on Consumer Electronics, Vol. 58, No. 2, pp. 691-699, May 2012 (IF: 0.941, Q3, JCR2011)
- Design of a Power-Efficient Parallel Pipelined Bloom Filter
- Deokho Kim, Doohwan Oh, and Won Woo Ro
- Electronics Letters, Vol. 48, No. 7, pp. 367-369, Mar. 2012 (IF: 0.965, Q3, JCR2011)
- Reconfigurable and Parallelized Network Coding Decoder for VANETs
- Sunwoo Kim and Won Woo Ro
Mobile Information Systems, Vol. 8, No. 1, pp. 45-59, Feb. 2012 (IF: 2.432, Q1, JCR2011)
- Accelerated Network Coding with Dynamic Stream Decomposition on Graphics Processing Unit
- Sangpil Lee and Won Woo Ro
- The Computer Journal, Vol. 55, No. 1, pp. 21-34, Jan. 2012 (IF: 0.785, Q3, JCR2011)
Conference Papers
- Conflict Avoidance Scheduling using Grouping List for Transactional Memory
- Dongmin Choi, Seung Hun Kim, and Won Woo Ro
- The 17th International Workshop on High-Level Parallel Programming Models and Supportive Environments
- (HIPS-17) Shanghai, China, May 21, 2012
- Cooperative Heterogeneous Computing for Parallel Processing on CPU/GPU Hybrids
- Changmin Lee, Won Woo Ro, and Jean-Luc Gaudiot
- The 16th Workshop on Interaction between Compilers and Computer Architectures
- (INTERACT-16) New Orleans, USA, Feb. 25 - 29, 2012
Journal Papers
- A Novel Sequential Tree Algorithm Based on Scoreboard for MPI Broadcast Communication
- Won-young Chung, Jae-won Park, Seung-Woo Lee, Won Woo Ro, and Yong-surk Lee
- IEICE Transactions on Information and Systems, Vol 94, No. 12, pp. 2523-2527, December. 2011 (IF: 0.268, Q4, JCR2010)
- Network Coding on Heterogeneous Multi-Core Processors for Wireless Sensor Networks
- Deokho Kim, Karam Park, and Won W. Ro
- Sensors, Vol 11, No. 8, pp. 7908-7933, Aug. 2011 (IF: 1.774, Q3, JCR2010)
- A Low-Cost Standard Mode MPI Hardware Unit for Embedded MPSoC
- Won-Young Chung, Ha-Young Jeong, Won W. Ro, and Yong-Surk Lee
- IEICE Transactions on Information and Systems, Vol. E94-D, No.7, pp. 1497-1501, July 2011 (IF: 0.268, Q4, JCR2010)
Conference Papers
- Parallel Transpose of Matrix Multiplication Based on the Tiling Algorithm
- Minwoo Kim, Yong J. Jang, and Won W. Ro
- The 54th IEEE International Midwest Symposium on Circuits and Systems
- (MWSCAS 2011) Seoul, Korea, Aug. 7 - 10, 2011
- Performance Evaluation of Adaptive Progressive Network Coding
- Deokho Kim, Karam Park, and Won W. Ro
- The 54th IEEE International Midwest Symposium on Circuits and Systems
- (MWSCAS 2011) Seoul, Korea, Aug. 7 - 10, 2011
Journal Papers
- Multithreaded Pattern Matching Algorithm with Data Rearrangement
- Doohwan Oh, Seung Hun Kim, and Won W. Ro
- IEICE Electronics Express, Vol. 7, No. 20, pp. 1520-1526, Oct. 2010 (IF: 0.510, Q3, JCR2009)
- On Improving Parallelized Network Coding with Dynamic Partitioning
- Karam Park, Joon-Sang Park, and Won W. Ro
IEEE Transactions on Parallel and Distributed Systems, Vol. 21, No. 11, pp. 1547-1560, Nov. 2010 (IF: 1.733, Q1, JCR2009)
- Hardware Implementation of a Tessellation Accelerator for the OpenVG Standard
- Seung Hun Kim, Yunho Oh, Karam Park, and Won W. Ro
- IEICE Electronics Express, Vol. 7, No. 6, pp. 440-446, Mar. 2010 (IF: 0.510, Q3, JCR2009)
Conference Papers
- Development of Virtual CUDA Systems of Parallel Processing on CPU and GPGPU
- Doohwan Oh, Sangpil Lee, Deokho Kim, Changmin Lee, and Won W. Ro
- Workshop on Micro Architectural Support for Virtualization, Data Center Computing, and Clouds In Conjunction with MICRO 2010
- (MASVDC Workshop 2010) Atlanta, USA, Dec. 5, 2010
- Implementing FFT using SPMD style of OpenMP
- Tien-Hsiung Weng, Sheng-Wei Huang, Won Woo Ro, and Kuan-Ching Li
- In Proc. of the 6th International Conference on Networked Computing and Advanced Information Management
- (NCM 2010) Seoul, Korea, Aug. 16 - 18, 2010
- Accelerated Reconstruction Using Parallel Computing for Spiral Spectroscopic Imaging
- Dong H. Kim, Yun H. Oh, Yun H. Nam, M. Gu, and Won W. Ro
- In Proc. of 2010 International Society for Magnetic Resonance in Medicine Annual Meeting
- (2010 ISMRM Annual Meeting) Stockholm, Sweden, May 1 - 7, 2010
- FPGA Implementation of Highly Parallelized Decoder Logic for Network Coding
- Sunwoo Kim and Won W. Ro
- In Proc. of Eighteenth ACM/SIGDA International Symposium on Field-Programmable Gate Arrays
- (FPGA 2010) Monterey, USA, Feb. 21 - 23, 2010
Journal Papers
- A Complexity-Effective Microprocessor Design with Decoupled Dispatch Queues and Prefetching
- Won W. Ro and Jean-Luc Gaudiot
- Parallel Computing, Vol. 35, No. 5, pp. 255-268, May 2009 (IF: 1.309, Q2, JCR2008)
Conference Papers
- Evaluation of Cache Coherence Protocols on Multi-Core Systems with Linear Workloads
- Yong J. Jang and Won W. Ro
- In Proc. of 2009 International Colloquium on Computing, Communication, Control, and Management
- (CCCM 2009)Sanya, China, Aug. 8 - 9, 2009
- Comparing Open Source Web Services: gSoap and AXIS
- Jongwook Woo and Won W. Ro
- In Proc. of the 24th International Technical Conference on Circuits/Systems, Computers and Communications
- (ITC-CSCC 2009)Jeju Island, Korea, July 5 - 8, 2009
- Efficient Parallelized Network Coding for P2P File Sharing Applications
- Karam Park, Joon-Sang Park, and Won W. Ro
- In Proc. of the 4th International Conference on Grid and Pervasive Computing
- (GPC 2009)Geneva, Switcherland, May 4 - 8, 2009
- Fully Pipelined Hardware Implementation of 128-bit SEED Block Cipher Algorithm
- Jaeyoung Yi, Karam Park, Joonseok Park, and Won W. Ro
- In Proc. of the 5th International Workshop on Applied Reconfigurable Computing
- (ARC 2009)Karlsruhe, Germany, Mar. 16 - 18, 2009
Book Chapters
- Programmability and Scalability on Multi-Core Architectures
- Jaeyoung Yi, Yong J. Jang, Doohwan Oh, and Won W. Ro
- Chapter in "Handbook of Research on Scalable Computing Technologies", edited by Kuan-Ching Li, Ching-Hsien Hsu, Laurence Tianruo Yang, Jack Dongarra, and Hans Zima, Information Science Reference, 2009
Journal Papers
- Efficient Peer-to-Peer File Sharing Using Network Coding in MANET
- Uichin Lee, Joon-Sang Park, Seung-Hoon Lee, Won W. Ro, Giovanni Pau, and Mario Gerla
- Journal of Communications and Networks, Vol. 10, No. 4, Dec. 2008 (IF: 0.223, Q4, JCR2007)
- A Low-Complexity Microprocessor Design with Speculative Pre-Execution
- Won W. Ro and Jean-Luc Gaudiot
- Journal of Systems Architecture, Vol. 54, No. 12, pp. 1101-1112, Dec. 2008 (IF: 0.490, Q3, JCR2007)
- Performance Evaluation of Programming Models for SMP-Based Clusters
- Myungho Lee, Neungsoo Park, Won W. Ro, and Kuan-Ching Li
- Journal of the Chinese Institute of Engineers, Vol. 31, No. 7, pp. 1181-1188, Dec. 2008 (IF: 0.183, Q4, JCR2007)
- Simultaneous Thin-Thread Processors for Low-Power Embedded Systems
- Won W. Ro, Jaeyoung Yi, Joon-Sang Park, and Joonseok Park
- IEICE Electronics Express, Vol. 5, No. 19, pp. 802-808, Oct. 2008 (IF: 0.436, Q3, JCR2007)
- Delay Analysis of Car-to-Car Reliable Data Delivery Strategies Based on Data Mulling with Network Coding
- Joon-Sang Park, Uichin Lee, Soon Young Oh, Mario Gerla, Desmond Siumen Lun, Won W. Ro, and Joonseok Park
- IEICE Transactions on Information and Systems, Vol. E91-D, No. 10, Oct. 2008 (IF: 0.245, Q4, JCR2007)
Conference Papers
- Parallel Algorithms for Steiner Tree Problem
- Joon-Sang Park, Won W. Ro, Handuck Lee, and Neungsoo Park
- In Proc. of the 3rd International Conference on Convergence and Hybrid Information Technology
- (ICHIT 2008)Busan, Korea, Nov. 11 - 13, 2008
Journal Papers
- Design and Evaluation of a Hierarchical Decoupled Architecture
- Won W. Ro, Stephen P. Crago, Alvin M. Despain, and Jean-Luc Gaudiot
- Journal of Supercomputing, Springer, Vol. 38, No. 3, pp. 237-259, Dec. 2006 (IF: 0.482, Q3, JCR2005)
- Speculative Pre-Execution Assisted by Compiler (SPEAR)
- Won W. Ro and Jean-Luc Gaudiot
- Journal of Parallel and Distributed Computing, Elsevier, Vol. 66, No. 8, pp. 1076-1089, Aug. 2006 (IF: 0.900, Q2, JCR2005)
Conference Papers
- Design and Effectiveness of Small-Sized Decoupled Dispatch Queues
- Won W. Ro and Jean-Luc Gaudiot
- In Proc. of European Conference on Parallel Computing - LNCS
- (EURO-PAR 2006) Dresden, Germany, Aug. 29 - Sep. 1, 2006
Conference Papers
- A Low-Complexity Issue Queue Design with Speculative Pre-Execution
- Won W. Ro and Jean-Luc Gaudiot
- In Proc. of the 12th International Conference on High Performance Computing
- (HiPC 2005) Goa, India, Dec. 18 - 21, 2005
Book Chapters
- Techniques to Improve Performance Beyond Pipelining: Superpipelining, Superscalar, and VLIW
- Jean-Luc Gaudiot, Jung-Yup Kang, and Won Woo Ro
- Chapter in "Computer Architecture", a volume of "Advance in Computers", edited by Ali R.Hurson, Elsevier, 2005
Conference Papers
- SPEAR: A Hybrid Model for Speculative Pre-Execution
- Won W. Ro and Jean-Luc Gaudiot
- In Proc. of the 18th International Parallel and Distributed Processing Symposium
- (IPDPS 2004)Santa Fe, New Mexico, 2004
Conference Papers
- HiDISC: A Decoupled Architecture for Data-Intensive Applications
- Won W. Ro, Jean-Luc Gaudiot, Stephen P. Crago, and Alvin M. Despain
- In Proc. of the 17th International Parallel and Distributed Processing Symposium
- (IPDPS 2003)Nice, France, Apr. 22 - 26, 2003
- Compiler Support for Dynamic Speculative Pre-Execution
- Won W. Ro and Jean-Luc Gaudiot
- In Proc. of the 7th Annual Workshop on Interaction between Compilers and Computer Architectures
- (INTERACT-7) in conjunction with HPCA-9 Anaheim, California, Feb. 8, 2003
Conference Papers
- Memory Latency: to Tolerate or to Reduce?
- Amol Bakshi, Jean-Luc Gaudiot, Wen-Yen Lin, Manil Makhija, Viktor K. Prasanna, Wonwoo Ro, and Chulho Shin
- In Proc. of the 12th Symposium on Computer Architecture and High Performance Computing
- (SBAC-PAD'00) Sao Pedro, Brazil, Oct. 24 - 27, 2000
- A High-Performance, Hierarchical Decoupled Architecture
- Stephen P. Crago, Alvin Despain, Jean-Luc Gaudiot, Manil Makhija, Wonwoo Ro, and Apoorv Srivastava
- In Proc. of the Memory access Decoupling for superscalar and multiple issue Architectures
- (MEDEA) Workshop in conjunction with PACT 2000 Philadelphia, Oct. 15, 2000
- A Reliable Cluster Computing with a New Checkpointing RAID-x Architecture
- Kai Hwang, Hai Jin, Roy Ho, and Wonwoo Ro
- In Proc. of the 9th Heterogeneous Computing Workshop
- (HCW) Cancun, Mexico, May 1, 2000
All Publications
In Press
Journal Papers
- REC: Enhancing fine-grained cache coherence protocol in multi-GPU systems
- Gun Ko, Jiwon Lee, Hongju Kal, Hyunwuk Lee, and Won Woo Ro
- Journal of Systems Architecture Vol. 160, March. 2025
- HashScape: Leveraging Virtual Address Dynamics for Efficient Hashed Page Tables
- Won Hur, Jiwon Lee, Jaewon Kwon, Minjae Kim, and Won Woo Ro
- IEEE Transactions on Computers 2025 
Conference Papers
- Ditto: Accelerating Diffusion Model via Temporal Value Similarity
- Sungbin Kim*, Hyunwuk Lee*, Wonho Cho, Mincheol Park, and Won Woo Ro
- The 31st IEEE International Symposium on High-Performance Computer
- (HPCA 2025)
- Marching Page Walks: Batching and Concurrent Page Table Walks for Enhancing GPU Throughput
- Jiwon Lee, Gun Ko, Myung Kuk Yoon, Ipoom Jeong, Yunho Oh, and Won Woo Ro
- The 31st IEEE International Symposium on High-Performance Computer
- (HPCA 2025)
- Qubit Movement-Optimized Program Generation on Zoned Neutral Atom Processors
- Enhyeok Jang, Youngmin Kim, Hyungseok Kim, Seungwoo Choi, Yipeng Huang, and Won Woo Ro
- The IEEE/ACM International Symposium on Code Generation and Optimization
- (CGO 2025)
- PIMutation: Exploring the Potential of Real PIM Architecture for Quantum Circuit Simulation
- Dongin Lee, Enhyeok Jang, Seungwoo Choi, Junwoong An, Cheolhwan Kim, and Won Woo Ro
- The 30th Asia and South Pacific Design Automation Conference
- (ASP-DAC 2025)
Journal Papers
- SHREG: Mitigating Register Redundancy in GPUs
- Seunghyun Jin, Hyunwuk Lee, Jonghyun Lee, Junsung Kim, and Won Woo Ro
- Journal of Systems Architecture Vol. 152, July. 2024
Conference Papers
- DEPrune: Depth-wise Separable Convolution Pruning for Maximizing GPU Parallelism
- Cheonjun Park, Mincheol Park, Hyunchan Moon, Myung Kuk Yoon, Seokjin Go, Suhyun Kim, and Won Woo Ro
- The 38th Annual Conference on Neural Information Processing Systems
- (NeurIPS 2024)
- AirGun: Adaptive Granularity Quantization for Accelerating Large Language Models
- Sungbin Kim, Hyunwuk Lee, Sungwoo Kim, Cheolhwan Kim, and Won Woo Ro
- The International Conference on Computer Design
- (ICCD 2024)
- MOSQ: Accelerating Classical Simulation of UCCSD Ansatz Circuits using Merged Operation
- Seungwoo Choi, Enhyeok Jang, Youngmin Kim, and Won Woo Ro
- The International Conference on Computer Design
- (ICCD 2024)
- Generalizing Ray Tracing Accelerators for Tree Traversals on GPUs
- Dongho Ha*, Lufei Liu*, Yuan Hsi Chou, Seokjin Go, Won Woo Ro, Hung-Wei Tseng, and Tor M. Aamodt
- The International Symposium on Microarchitecture
- (MICRO 2024)
- Barber: Balancing Thermal Relaxation Deviations of NISQ Programs by Exploiting Bit-Inverted Circuits
- Enhyeok Jang, Seungwoo Choi, Youngmin Kim, Jeewoo Seo, and Won Woo Ro
- The 2024 ACM/IEEE International Conference on Computer-Aided Design
- (ICCAD 2024)
- Recompiling QAOA Circuits on Various Rotational Directions
- Enhyeok Jang, Dongho Ha, Seungwoo Choi, Youngmin Kim, Jaewon Kwon, Yongju Lee, Sungwoo Ahn, Hyungseok Kim, and Won Woo Ro
- The 33rd International Conference on Parallel Architectures and Compilation Techniques
- (PACT 2024)
- M3XU: Achieving High-Precision and Complex Matrix Multiplication with Low-Precision MXUs
- Dongho Ha, Yunan Zhang, Chen-Chien Kao, Christopher J. Hughes, Won Woo Ro, and Hung-Wei Tseng
- The International Conference for High Performance Computing, Networking, Storage, and Analysis
- (SC 2024)
- GUMSO: Gating Unnecessary On-Chip Memory Slices for Power Optimization on GPUs
- Seunghyun Jin, Hyunwuk Lee, and Won Woo Ro
- The 2024 ACM/IEEE International Symposium on Low Power Electronics and Design
- (ISLPED 2024)
- Geneva: A Dynamic Confluence of Speculative Execution and In-Order Commitment Windows
- Yanghee Lee, Jiwon Lee, Jaewon Kwon, Yongju Lee, and Won Woo Ro
- The 61th Design Automation Conference
- (DAC 2024)
- Systolic Array Architecture Supporting Multiple Scaling Factors for U-Net Quantization
- Hyunwuk Lee and Won Woo Ro
- The 23th International Conference on Electronics, Information, and Communication
- (ICEIC 2024)
- Evaluating Performance of Shared On-Chip Caches in Multi-GPUs
- Gun Ko and Won Woo Ro
- The 23th International Conference on Electronics, Information, and Communication
- (ICEIC 2024)
- A Multi-DNN Acceleration Architecture for Balanced QoS and Throughput
- Ipoom Jeong, Sungji Choi, Minjae Kim, Enhyeok Jang, Seokjin Go, and Won Woo Ro
- The 23th International Conference on Electronics, Information, and Communication
- (ICEIC 2024)
- Integrated Framework Design Methodologies to Support Processing-In-Memory Platforms
- Enhyeok Jang, Hongju Kal, Jaewon Kwon, and Won Woo Ro
- The 23th International Conference on Electronics, Information, and Communication
- (ICEIC 2024)
- REPrune: Channel Pruning via Kernel Representative Selection
- Mincheol Park, Dongjin Kim, Cheonjun Park, Yuna Park, Gyeong Eun Gong, Won Woo Ro, and Suhyun Kim
- The 38th AAAI Conference on Artificial Intelligence
- (AAAI 2024)
Journal Papers
- A Convertible Neural Processor Supporting Adaptive Quantization for Real-Time Neural Networks
- Hongju Kal, Hyoseong Choi, Ipoom Jeong, Joon-Sung Yang, and Won Woo Ro
- Journal of Systems Architecture Vol. 145, Nov. 2023
Conference Papers
- INTERPRET: Inter-Warp Register Reuse for GPU Tensor Cores
- Jae Seok Kwak, Myung Kuk Yoon, Ipoom Jeong, Seunghyun Jin, and Won Woo Ro
- The 32th International Conference on Parallel Architectures and Compilation Techniques
- (PACT 2023)
- McCore: A Holistic Management of High-Performance Heterogeneous Multicores
- Jaewon Kwon, Yongju Lee, Hongju Kal, Minjae Kim, Youngsok Kim, and Won Woo Ro
- The 56th International Symposium on Microarchitecture
- (MICRO 2023)
- AESPA: Asynchronous Execution Scheme to Exploit Bank-Level Parallelism of Processing-in-Memory
- Hongju Kal, Chanyoung Yoo, and Won Woo Ro
- The 56th International Symposium on Microarchitecture
- (MICRO 2023)
- MAD MAcce: Supporting Multiply-Add Operations for Democratizing Matrix-Multiplication Accelerator
- Seunghwan Sung, Sujin Hur, Dongho Ha, Sungwoo Kim, Yunho Oh, and Won Woo Ro
- The 56th International Symposium on Microarchitecture
- (MICRO 2023)
- Exploiting Inherent Properties of Complex Numbers for Accelerating Complex Valued Neural Networks
- Hyunwuk Lee, Hyungjun Jang, Sungbin Kim, Sungwoo Kim, Wonho Cho, and Won Woo Ro
- The 56th International Symposium on Microarchitecture
- (MICRO 2023)
- Performance Analysis of Criticality-Aware Out-of-Order Cores for Exploiting MLP
- Yanghee Lee, Jiwon Lee, and Won Woo Ro
- The 38th International Technical Conference on Circuits/Systems, Computers and Communications
- (ITC-CSCC 2023)
- Adaptive Data Prefetcher with Probability Learning in LLC
- Jusin Kim, Jiwon Lee, and Won Woo Ro
- The 38th International Technical Conference on Circuits/Systems, Computers and Communications
- (ITC-CSCC 2023)
- Context Swap: Multi-PIM System Preventing Remote Memory Access for Large Embedding Model Acceleration
- Hongju Kal, Cheolhwan Kim, Minjae Kim, and Won Woo Ro
- The 2023 IEEE International Conference on Artificial Intelligence Circuits and Systems
- (AICAS 2023)
- TensorCV: Accelerating Non-AI/ML Stages in Computing Vision Pipelines using Tensor Processors
- Dongho Ha, Won Woo Ro, and Hung-Wei Tseng
- The 2023 ACM/IEEE International Symposium on Low Power Electronics and Design
- (ISLPED 2023)
- R2D2: Removing ReDunDancy Utilizing Linearity of Address Generation in GPUs
- Dongho Ha, Yunho Oh, and Won Woo Ro
- The 50th International Symposium on Computer Architecture
- (ISCA 2023)
- Early-Adaptor: An Adaptive Framework for Proactive UVM Memory Management
- Seokjin Go, Hyunwuk Lee, Junsung Kim, Jiwon Lee, Myung Kuk Yoon, and Won Woo Ro
- The 2023 IEEE International Symposium on Performance Analysis of Systems and Software
- (ISPASS 2023)
- Lightning Talk: Efficiency and Programmability of DNN Accelerators and GPUs
- Won Woo Ro
- The 60th ACM/IEEE Design Automation Conference
- (DAC 2023)
- Quixote: Improving Fidelity of Quantum Program by Independent Execution of Controlled Gates
- Enhyeok Jang, Seungwoo Choi, and Won Woo Ro
- The 60th ACM/IEEE Design Automation Conference
- (DAC 2023)
- Balanced Column-Wise Block Pruning for Maximizing GPU Parallelism
- Cheonjun Park, Mincheol Park, Hyun Jae Oh, Minkyu Kim, Myung Kuk Yoon, Suhyun Kim, and Won Woo Ro
- The 37th AAAI Conference on Artificial Intelligence
- (AAAI 2023)
- SnakeByte: A TLB Design with Adaptive and Recursive Page Merging in GPUs
- Jiwon Lee, Ju Min Lee, Yunho Oh, William J. Song, and Won Woo Ro
- The 29th IEEE International Symposium on High-Performance Computer
- (HPCA 2023)
- Analysis on Memory Access Patterns of Server-Class Workloads in Page- and Cache Line- Granularity
- Kyeonghoon Lim, Minjae Kim, Jiwon Lee, and Won Woo Ro
- The 22th International Conference on Electronics, Information, and Communication
- (ICEIC-2023)
- Enabling Heterogeneous Memory System over CXL
- Dongin Lee, Sungbin Kim, Hyungjun Jang, Sungwoo Kim, and Won Woo Ro
- The 22th International Conference on Electronics, Information, and Communication
- (ICEIC-2023)
- Investigation on NVIDIA Ampere GPU Architecture with Reverse Engineering
- Sujin Hur, Seunghwan Sung, Dongho Ha, Sungwoo Kim, and Won Woo Ro
- The 22th International Conference on Electronics, Information, and Communication
- (ICEIC-2023)
Journal Papers
- TEA-RC: Thread Context-Aware Register Cache for GPUs
- Ipoom Jeong, Yunho Oh, Won Woo Ro, and Myung Kuk Yoon
- Accepted to IEEE Access
- CASH-RF: A Compiler-Assisted Hierarchical Register File in GPUs
- Yunho Oh, Ipoom Jeong, Won Woo Ro, and Myung Kuk Yoon
- Accepted to IEEE Embedded Systems Letters
- FLIXR: Embedding Index into Flash Translation Layer in SSDs
- Gunjae Koo, Yunho Oh, Hung-Wei Tseng, Won Woo Ro, and Murali Annavaram
- Accepted to IEEE Transactions on Computers
Conference Papers
- 다종의 프로세싱 인 메모리 구조를 활용하기 위한 BLAS 기반의 프레임 워크 구현
- 유찬영, 장은혁, 갈홍주, 노원우
- 대한전자공학회 추계학술대회
- Reconstructing Out-of-Order Issue Queue
- Ipoom Jeong, Jiwon Lee, Myung Kuk Yoon, and Won Woo Ro
- The 55th IEEE/ACM International Symposium on Microarchitecture
- (MICRO 2022)
- Analysis of SSD with Logical to Physical Address Mapping of Hot Data to Single Level Cell Area
- Gyuseok Choe, Youngmin Lee, and Won Woo Ro
- The 37th International Technical Conference on Circuits/Systems, Computers and Communications
- (ITC-CSCC 2022)
- Analysis of DRAM-based Network of DRAM Swap Space Adopting Latency Hiding Technique
- Hyoseong Choi, Jiwon Lee, Jeonghoon Choi, and Won Woo Ro
- The 37th International Technical Conference on Circuits/Systems, Computers and Communications
- (ITC-CSCC 2022)
- PR3D: Processing Recommendation Systems in 3D-Stacked DRAM Adopting Heterogeneous Data Format
- Chanyoung Yoo, Hongju Kal, and Won Woo Ro
- The 21th International Conference on Electronics, Information, and Communication
- (ICEIC-2022)
Journal Papers
- Two-Stage In-Storage Processing and Scheduling for Pattern Matching Applications
- Joohyeong Yoon, Yoonjin Lee, Won Seob Jeong, and Won Woo Ro
- IEEE Access, Vol. 9, pp. 95702-95715, Jun. 2021
- PIMCaffe: Functional Evaluation of a Machine Learning Framework for In-Memory Neural Processing Unit
- Won Jeon, Jiwon Lee, Dongseok Kang, Hongju Kal, and Won Woo Ro
- IEEE Access, Vol. 9, pp. 96629-96640, Jul. 2021
Conference Papers
- SPACE: Locality-Aware Processing in Heterogeneous Memory for Personalized Recommendations
- Hongju Kal, Seokmin Lee, Gun Ko, and Won Woo Ro
- The 48th ACM/IEEE International Symposium on Computer Architecture
- (ISCA-2021)
- Analysis of GPU Scheduling Technique for Convergence Barrier
- Jae Seok Kwak and Won Woo Ro
- The 20th International Conference on Electronics, Information, and Communication
- (ICEIC-2021)
- Delay Analysis on Tensor Access Patterns of CNN Algorithms
- Jonathan Robert Malin and Won Woo Ro
- The 20th International Conference on Electronics, Information, and Communication
- (ICEIC-2021)
- Detecting Pattern of Warp Register Value Differences in CTA using GPU Compiler
- Dongho Ha and Won Woo Ro
- The 20th International Conference on Electronics, Information, and Communication
- (ICEIC-2021)
- Analysis of Multiple-Application Support Techniques in GPU
- Jonghyun Lee and Won Woo Ro
- The 6th International Conference On Consumer Electronics (ICCE) Asia
- (ICCE-ASIA 2021)
- Analysis of Key-Value SSD to Improve the Performance of Key-Value Store System
- Gyuseok Choe, Jeonghoon Choi and Won Woo Ro
- The 6th International Conference On Consumer Electronics (ICCE) Asia
- (ICCE-ASIA 2021)
- QoS-Aware Scheduling for Cellular Networks Using Deep Reinforcement Learning
- Jonathan Robert Malin, Gun Ko and Won Woo Ro
- The 18th IFIP International Conference on Network and Parallel Computing
- (NPC 2021)
Journal Papers
- Hi-End: Hierarchical, Endurance-Aware STT-MRAM-Based Register File for Energy-Efficient GPUs
- Won Jeon, Jun Hyun Park, Yoonsoo Kim, Gunjae Koo, and Won Woo Ro
- IEEE Access, Vol. 8, pp. 127768-127780, Jul. 2020
- REACT: Scalable and High-Performance Regular Expression Pattern Matching Accelerator for In-Storage Processing
- Won Seob Jeong, Changmin Lee, Keunsoo Kim, Myung Kuk Yoon, Won Jeon, Myoungsoo Jung, and Won Woo Ro
- IEEE Transactions on Parallel and Distributed Systems, Vol. 31, Issue 5, pp.1137-1151, May. 2020
Conference Papers
- BENEFIT: Basic Linear Algebra Subprogram and Neural Network framework for FPGA-based Neural Processing Units
- Dongseok Kang and Won Woo Ro
- The Fifth International Conference On Consumer Electronics Asia
- (ICCE-ASIA 2020)
- Busan, Korea, Nov. 1 - 3, 2020
- OASIS: Overhead Analysis of Systolic Neural Processing Unit on LSTM
- Byunghwy Choi and Won Woo Ro
- The Fifth International Conference On Consumer Electronics Asia
- (ICCE-ASIA 2020)
- Busan, Korea, Nov. 1 - 3, 2020
- Interaction Data Analysis for Personalized Recommendation System
- Seokmin Lee and Won Woo Ro
- The Fifth International Conference On Consumer Electronics Asia
- (ICCE-ASIA 2020)
- Busan, Korea, Nov. 1 - 3, 2020
- BODCA: Heterogeneous CPU-GPU computing system with Bandwidth-Optimized DRAM cache design
- Sungji Choi and Won Woo Ro
- The Fifth International Conference On Consumer Electronics Asia
- (ICCE-ASIA 2020)
- Busan, Korea, Nov. 1 - 3, 2020
- Duplo: Lifting Redundant Memory Accesses of Neural Networks for GPU Tensor Cores
- Hyeonjin Kim, Sungwoo Ahn, Yunho Oh, Bogil Kim, Won Woo Ro, and William J. Song
- The 53rd IEEE/ACM International Symposium on Microarchitecture
- (MICRO 2020)
- Virutal Conference, Oct. 17 - Oct. 21, 2020
- Check-In: In-Storage Checkpointing for Key-Value Store System Leveraging Flash-Based SSDs
- Joohyeong Yoon, Won Seob Jeong, and Won Woo Ro
- The 47th ACM/IEEE International Symposium on Computer Architecture
- (ISCA 2020)
- Virutal Conference, May. 29 - Jun. 3, 2020
- CASINO Core Microarchitecture: Generating Out-of-Order Schedules Using Cascaded In-Order Scheduling Windows
- Ipoom Jeong, Seihoon Park, Changmin Lee, and Won Woo Ro
- The 26th International IEEE Symposium on High Performance Computer Architecture
- (HPCA 2020)
- San Diego, CA, USA, Feb. 22 - 26, 2020
- Self-controllable refresh target row skip and inclusion technique for the intelligent DRAM
- Jaein Song and Won Woo Ro
- The 19th International Conference on Electronics, Information and Communication
- (ICEIC 2020)
- Access Characteristic-based Cache Replacement Policy in an SSD
- Joohyeong Yoon and Won Woo Ro
- The 19th International Conference on Electronics, Information and Communication
- (ICEIC 2020)
Journal Papers
- OverCome: Coarse-Grained Instruction Commit with Handover Register Renaming
- Ipoom Jeong, Changmin Lee, Keunsoo Kim, and Won Woo Ro
- IEEE Transactions on Computers, Vol. 68, Issue 12, pp. 1802-1816, Dec. 2019
- Contents-Aware Partitioning Algorithm for Parallel High Efficiency Video Coding
- Kyungah Kim and Won Woo Ro
- Multimedia Tools and Applications, Multimedia Tools and Applications, Vol. 78, Issue 9, pp. 11427-11442, May. 2019
- Fast CU Depth Decision for HEVC using Neural Networks
- Kyungah Kim and Won Woo Ro
- IEEE Transactions on Circuits and Systems for Video Technology, Vol. 29, No. 5, pp. 1462-1473, May. 2019
- Adaptive Cooperation of Prefetching and Warp Scheduling on GPUs
- Yunho Oh, Keunsoo Kim, Myung Kuk Yoon, Jong Hyun Park, Yongjun Park, Murali Annavaram, and Won Woo Ro
- IEEE Transactions on Computers, Vol. 68, No. 4, pp. 609-616, Apr. 2019
Conference Papers
- Efficient Dilated-Winograd Convolutional Neural Networks
- Minsik Kim, Cheonjun Park, Sungjun Kim, Taeyoung Hong, and Won Woo Ro
- The 2019 IEEE International Conference on Image Processing, Accepted
- Performance Scalability Limit of PARSEC Benchmark on a Many-Core Processor
- Won Seob Jeong and Won Woo Ro
- The 34th International Technical Conference on Circuits/Systems, Computers and Communications
- (ITC-CSCC 2019)
- Jeju, Korea, Jun. 23 - 26, 2019
- Analysis of SSD Internal DRAM Sensitivity for a Key-Value Store
- Yongseok Won, Yoonjin Lee, Won Seob Jeong, and Won Woo Ro
- The 34th International Technical Conference on Circuits/Systems, Computers and Communications
- (ITC-CSCC 2019)
- Jeju, Korea, Jun. 23 - 26, 2019
- Exploiting GPU hierarchical TLB in Multi-Application Execution
- Hyun Jae Oh, Won Jeon, and Won Woo Ro
- The 34th International Technical Conference on Circuits/Systems, Computers and Communications
- (ITC-CSCC 2019)
- Jeju, Korea, Jun. 23 - 26, 2019
- Hierarchical, Compressed STT-MRAM Register File for GPU
- Jun Hyun Park and Won Woo Ro
- The 34th International Technical Conference on Circuits/Systems, Computers and Communications
- (ITC-CSCC 2019)
- Jeju, Korea, Jun. 23 - 26, 2019
- Linebacker: Preserving Victim Cache Lines in Idle Register Files of GPUs
- Yunho Oh, Gunjae Koo, Murali Annavaram, and Won Woo Ro
- The 46th ACM/IEEE International Symposium on Computer Architecture
- (ISCA 2019)
- Phoenix, Arizona, USA, Jun. 22 - 26, 2019
- Analysis of SSD Internal Cache Problem in a Key-Value Store System
- Won Seob Jeong, Yongseok Won, and Won Woo Ro
- The 2nd International Conference on Big Data and Smart Computing
- (ICBDSC 2019)
- Bali, Indonesia. Jan. 10 - 13, 2019
Journal Papers
- 고성능 그래픽 처리 장치 발전 동향
- 하동호, 이현욱, 이지원, 오현재, 전원, 오윤호, 노원우
- 한국정보과학회 정보과학회지
- WASP: Selective Data Prefetching with Monitoring Runtime Warp Progress on GPUs
- Yunho Oh, Myung Kuk Yoon, Jong Hyun Park, Yongjun Park, and Won Woo Ro
- IEEE Transactions on Computers, Vol. 67, No. 9, pp. 1366-1373, Sep. 2018
- Exploiting Pseudo-Quadtree Structure for Accelerating HEVC Spatial Resolution Downscaling Transcoder
- Minsik Kim, Minyong Sung, Minwoo Kim, and Won Woo Ro
- IEEE Transactions on Multimedia, Vol. 20, No. 9, pp. 2262-2275, Sep. 2018
- Architectural Protection of Application Privacy against Software and Physical Attacks in Untrusted Cloud Environment
- Lei Xu, JongHyuk Lee, Seung Hun Kim, Qingji Zheng, Shouhuai Xu, Taeweon Suh, Won Woo Ro, and Weidong Shi
- IEEE Transactions on Cloud Computing, Vol. 6, No. 2, pp. 478-491, Apr-Jun. 2018
- Simultaneous and Speculative Thread Migration for Improving Energy Efficiency of Heterogeneous Core Architectures
- Changmin Lee and Won Woo Ro
- IEEE Transactions on Computers, Vol. 67, No. 4, pp. 498-512, Apr. 2018
Conference Papers
- Region of Interest based Frame Rate Up-Conversion using Encoded Bit-stream
- Kyungah Kim and Won Woo Ro
- International Conference on Communication, Image and Signal Processing
- (CCISP 2018)
- Sanya, China. Nov. 16 - 18, 2018
- FineReg: Fine-Grained Register File Management for Augmenting GPU Throughput
- Yunho Oh, Myung Kuk Yoon, William J. Song, and Won Woo Ro
- The 51st IEEE/ACM International Symposium on Microarchitecture
- (MICRO 2018)
- Fukuoka, Japan, Oct. 20 - 24, 2018
- Fast Intra LCU Decision using Deep Neural Networks
- Kyungah Kim and Won Woo Ro
- The International Conference On Big data, IoT, and Cloud Computing
- (BIC-18)
- Jeju, Korea, Aug. 20 - 22, 2018
- Near-Data Processing Optimization for Efficient Neural Network Computations
- Sungwoo Ahn, Won Jeon, and Won Woo Ro
- The 3rd International Conference On Consumer Electronics Asia
- (ICCE-ASIA 2018)
- Jeju, Korea, Jun. 24 - 26, 2018
- Constructing Resilient Region in Dynamic Optimization Systems via Dynamic Adjustment of Bias Thresholds
- Ipoom Jeong and Won Woo Ro
- The 3rd International Conference On Consumer Electronics Asia
- (ICCE-ASIA 2018)
- Jeju, Korea, Jun. 24 - 26, 2018
- WIR: Warp Instruction Reuse to Minimize Repeated Computations in GPUs
- Keunsoo Kim and Won Woo Ro
- The 24th International IEEE Symposium on High Performance Computer Architecture
- (HPCA 2018)
- Wien, Austria, Feb. 24 - 28, 2018
- Efficient and Reliable NAND Flash Channel for High-Speed Solid State Drives
- Joohyeong Yoon, Won Seob Jeong, Won Jeon, and Won Woo Ro
- The 17th International Conference on Electronics, Information and Communication
- (ICEIC 2018)
- Honolulu, HI, USA, Jan. 24 - 27, 2018
- Fast Robot Software Framework with Object-Oriented Design
- Heekuk Lee, Keunsoo Kim, and Won Woo Ro
- The 17th International Conference on Electronics, Information and Communication
- (ICEIC 2018)
- Honolulu, HI, USA, Jan. 24 - 27, 2018
Journal Papers
- Dynamic Resizing on Active Warps Scheduler to Hide Operation Stalls on GPUs
- Myung Kuk Yoon, Yunho Oh, Sangpil Lee, Seung Hun Kim, Deokho Kim, and Won Woo Ro
- IEEE Transactions on Parallel and Distributed Systems, Vol. 28, No. 11, pp. 3142-3156, Nov. 2017
- Dynamic Load Balancing of Dispatch Scheduling for Solid State Disks
- Myunghyun Jo and Won Woo Ro
- IEEE Transactions on Computers, Vol. 66, No. 6, pp. 1034-1047, Jun. 2017
- Improving Energy Efficiency of GPUs through Data Compression and Compressed Execution
- Sangpil Lee, Keunsoo Kim, Gunjae Koo, Hyeran Jeon, Murali Annavaram, and Won Woo Ro
- IEEE Transactions on Computers, Vol. 66, No. 5, pp. 834-847, May 2017
Conference Papers
- Parallel In-Order Execution Architecture for Low-Power Processor
- Kyungmin Lee, Ipoom Jeong, and Won Woo Ro
- The 14th International SoC Design Conference
- (ISOCC 2017)
- Seoul, Korea, Nov. 5 - 8, 2017
- Characterizing Convolutional Neural Network Workloads on a Detailed GPU Simulator
- Kwanghee Chang, Minsik Kim, Kyungah Kim, and Won Woo Ro
- The 14th International SoC Design Conference
- (ISOCC 2017)
- Seoul, Korea, Nov. 5 - 8, 2017
- Access Pattern-Aware Cache Management for Improving Data Utilization in GPU
- Gunjae Koo, Yunho Oh, Won Woo Ro, and Murali Annavaram
- The 44th ACM/IEEE International Symposium on Computer Architecture
- (ISCA 2017)
- Torronto, Canada, Jun. 24 - 28, 2017
- Dynamic Warp Scheduler Selection Policy Using Linear Regression for GPUs
- Hyunjune Shin, Kyungmin Lee, Ipoom Jeong, Jong Hyun Park, and Won Woo Ro
- The 16th International Conference on Electronics, Information and Communication
- (ICEIC 2017)
- Phuket, Thailand, Jan. 11 - 14, 2017
- Exploiting L2 Cache Sensitivity in Artificial Neural Network on GPUs
- Seihoon Park, Yoonsoo Kim, Minsik Kim, and Won Woo Ro
- The 16th International Conference on Electronics, Information and Communication
- (ICEIC 2017)
- Phuket, Thailand, Jan. 11 - 14, 2017
- Optimizing Intersection and Reflection Step of Geometrical Optics using GPUs
- Hyun Jin Chung, Myung Kuk Yoon, and Won Woo Ro
- The 16th International Conference on Electronics, Information and Communication
- (ICEIC 2017)
- Phuket, Thailand, Jan. 11 - 14, 2017
- Analysis of Error Tolerance in Convolution Neural Networks
- Sangheon Kwon, Jong Hyun Park, and Won Woo Ro
- The 16th International Conference on Electronics, Information and Communication
- (ICEIC 2017)
- Phuket, Thailand, Jan. 11 - 14, 2017
Journal Papers
- Server Side, Play Buffer Based Quality Control for Adaptive Media Streaming
- Keunsoo Kim, Benjamin Y. Cho, and Won Woo Ro
- Multimedia Tools and Applications, Vol. 75, No. 10, pp. 5397-5415, May 2016
- Exploiting Thread-Level Parallelism on HEVC by Employing Reference Dependency Graph

- Minwoo Kim, Deokho Kim, Kyungah Kim, and Won Woo Ro
- IEEE Transactions on Circuits and Systems for Video Technology, Vol. 26, No. 4, pp. 736-749, Apr. 2016
- Parallel GPU Architecture Simulation Framework Exploiting Architectural-Level Parallelism with Timing Error Prediction
- Sangpil Lee and Won Woo Ro
- IEEE Transactions on Computers, Vol. 65, No. 4, pp. 1253-1265, Apr. 2016
Conference Papers
- Measuring Error-Tolerance in SRAM Architecture on Hardware Accelerated Neural Network
- Sangheon Kwon, Kyungmin Lee, Yoonsoo Kim, Kyungah Kim, Changmin Lee, and Won Woo Ro
- The 1st IEEE International Conference on Consumer Electronics Asia
- (ICCE-ASIA 2016)
- Seoul, Korea, Oct. 26 - 28, 2016
- Virtual Thread: Maximizing Thread-Level Parallelism beyond GPU Scheduling Limit
- Myung Kuk Yoon, Keunsoo Kim, Sangpil Lee, Won Woo Ro, and Murali Annavaram
- The 43rd ACM/IEEE International Symposium on Computer Architecture
- (ISCA 2016)
- Seoul, Korea, Jun. 18 - 22, 2016
- APRES: Improving Cache Efficiency by Exploiting Load Characteristics on GPUs
- Yunho Oh, Keunsoo Kim, Myung Kuk Yoon, Jong Hyun Park, Yongjun Park, Won Woo Ro, and Murali Annavaram
- The 43rd ACM/IEEE International Symposium on Computer Architecture
- (ISCA 2016)
- Seoul, Korea, Jun. 18 - 22, 2016
- Warped-Slicer: Efficient Intra-SM Slicing through Dynamic Resource Partitioning for GPU Multiprogramming
- Qiumin Xu, Hyeran Jeon, Keunsoo Kim, Won Woo Ro, and Murali Annavaram
- The 43rd ACM/IEEE International Symposium on Computer Architecture
- (ISCA 2016)
- Seoul, Korea, Jun. 18 - 22, 2016
- Warped-Preexecution: A GPU Pre-execution Approach for Improving Latency Hiding
- Keunsoo Kim, Sangpil Lee, Myung Kuk Yoon, Gunjae Koo, Won Woo Ro, and Murali Annavaram
- The 22nd International IEEE Symposium on High Performance Computer Architecture
- (HPCA 2016)
- Barcelona, Spain, Mar. 12 - 16, 2016
- Accelerating Forwading Computation of ANN using CUDA
- Jong Hyun Park and Won Woo Ro
- The 15th International Conference on Electronics, Information and Communication
- (ICEIC 2016)
- Danang, Vietnam, Jan. 27 - 30, 2016
- Fairness-Aware Thread Scheduling for Multithreaded Program using Intel Software Guarded Extensions
- Won Jeon, Seung Hun Kim, and Won Woo Ro
- The 15th International Conference on Electronics, Information and Communication
- (ICEIC 2016)
- Danang, Vietnam, Jan. 27 - 30, 2016
Journal Papers
- A Performance-Energy Model to Evaluate Single Thread Execution Acceleration
- Seung Hun Kim, Dohoon Kim, Changmin Lee, Won Seob Jeong, Won Woo Ro, and Jean-Luc Gaudiot
- IEEE Computer Architecture Letters, Vol.14, No.2, pp. 99-102, Dec. 2015
- Dynamic Load Balancing of Parallel SURF with Vertical Partitioning
- Deokho Kim, Minwoo Kim, Kyungah Kim, Minyong Sung, and Won Woo Ro
- IEEE Transactions on Parallel and Distributed Systems, Vol. 26, No. 12, pp. 3358-3370, Dec. 2015
- Network Variation and Fault Tolerant Performance Acceleration in Mobile Devices with Simultaneous Remote Execution
- Keunsoo Kim, Benjamin Y. Cho, Won Woo Ro, and Jean-Luc Gaudiot
- IEEE Transactions on Computers, Vol. 64, No. 10, pp. 2862-2874, Oct. 2015
- Highly Secure Mobile Devices Assisted with Trusted Cloud Computing Environments
- Doohwan Oh, Ilkyu Kim, Keunsoo Kim, Sang-Min Lee, and Won Woo Ro
- ETRI Journal, Vol. 37, No. 2, pp. 348-358, Apr. 2015
Conference Papers
- True Motion Compensation With Feature Detection for Frame Rate Up-Conversion
- Kyungah Kim, Minwoo Kim, Deokho Kim, and Won Woo Ro
- The 2015 IEEE International Conference on Image Processing
- (ICIP 2015)
- Quebec City, Canada, Sep. 27 - 30, 2015
- An Accelerated Separable Median Filter with Sorting Networks
- Minsik Kim, Deokho Kim, Minyong Sung, and Won Woo Ro
- The 2015 IEEE International Conference on Image Processing
- (ICIP 2015)
- Quebec City, Canada, Sep. 27 - 30, 2015
- Contention-Free Fair Queuing for High-Speed Storage with RAID-0 Architecture
- Myung Hyun Jo and Won Woo Ro
- The 17TH IEEE International Conference on High Performance Computing and Communications
- (HPCC 2015)
- New York, USA, Aug. 24 - 26, 2015
- Integrity Protection for Big Data Processing with Dynamic Redundancy Computation
- Zhimin Gao, Nicholas DeSalvo, Pham Dang Khoa, Seung Hun Kim, Lei Xu, Won Woo Ro, Rakesh M. Verma,
and Weidong Shi
- The 2015 IEEE International Conference on Autonomic Computing
- (ICAC 2015)
- Grenoble, France, July 7 - 10, 2015
- Improving Pipeline Utilization with Two-Level Instruction Issue on GPUs
- Yunho Oh, Jong Hyun Park, and Won Woo Ro
- The 30th International Techinical Conference on Circuits/Systems, Computers and Communicaions
- (ITC-CSCC 2015)
- Seoul, Korea, Jun. 29 - July 2, 2015
- Accelerating ELMs on the GPU Toward Real-Time Training on Large Scale Data Sets
- Han Kyul Kim, Jong Hyun Park, and Won Woo Ro
- The 30th International Technical Conference on Circuits/Systems, Computers and Communications
- (ITC-CSCC 2015)
- Seoul, Korea, Jun. 29 - July 2, 2015
- A Frequency Scaling Model for Energy Efficient DVFS Designs based on Circuit Delay Optimization
- Ki Bum Chun, Changmin Lee and Won Woo Ro
- The 19th IEEE International Symposium on Consumer Electronics
- (ISCE 2015)
- UPM, Madrid, Spain, Jun. 24 - 26, 2015
- Another Look at Secure Big Data Processing: a Formal Framework and a Practical Approach
- Lei Xu, Seung Hun Kim, Won Woo Ro, and Weidong Shi
- The 8th IEEE International Conference on Cloud Computing
- (Cloud'15, Application Track)
- New York, USA, Jun. 27 - July 2, 2015
- Enhancing Software Dependability and Security with Hardware Supported Instruction Address Space Randomization
- Seung Hun Kim, Lei Xu, Ziyi Liu, Zhiqiang Lin, Won Woo Ro, and Weidong Shi
- The 45th Annual IEEE/IFIP International Conference on Dependable Systems and Networks
- (DSN 2015)
- Rio de Janerio, Brazil, Jun. 22 - 25, 2015
- Warped-Compression: Enabling Power Efficient GPUs through Register Compression
- Sangpil Lee, Keunsoo Kim, Gunjae Koo, Hyeran Jeon, Won Woo Ro, and Murali Annavaram
- The 42nd ACM/IEEE International Symposium on Computer Architecture
- (ISCA 2015)
- Portland, OR, USA, Jun. 13 - 17, 2015
- DRAW: Investigating Benefits of Adaptive Fetch Group Size on GPU
- Myung Kuk Yoon, Yunho Oh, Sangpil Lee, Seung Hun Kim, Deokho Kim, and Won Woo Ro
- The 2015 IEEE International Symposium on Performance Analysis of Systems and Software
- (ISPASS 2015)
- Philadelphia, PA, USA, Mar. 29 - 31, 2015
Journal Papers
- A Malicious Pattern Detection Engine for Embedded Security Systems in Internet of Things
- Doohwan Oh, Deokho Kim, and Won Woo Ro
- Sensors, Vol. 14, No. 12, pp. 24188-24211, Dec. 2014
C-Lock: Energy Efficient Synchronization for Embedded Multicore Systems
- Seung Hun Kim, Sang Hyong Lee, Minje Jun, Byunghoon Lee, Won Woo Ro, Eui-Young Chung,
and Jean-Luc Gaudiot
- IEEE Transactions on Computers, Vol. 63, No. 8, pp. 1962-1974, Aug. 2014
- Swarm Processor System: Hardware Process Scheduler based Energy Efficient Multi-Core System
- Won Seob Jeong, Seung Hun Kim, Sang-Min Lee, and Won Woo Ro
- IEICE Electronics Express, Vol. 11, No. 14, pp. 20140424, July 2014
- Complexity-Effective Contention Management with Dynamic Backoff for Transactional Memory Systems
- Seung Hun Kim, Dongmin Choi, Won Woo Ro, and Jean-Luc Gaudiot
- IEEE Transactions on Computers, Vol. 63, No. 7, pp. 1696-1708, July 2014
- Architectural Investigation of Matrix Data Layout on Multicore Processors
- Minwoo Kim and Won Woo Ro
- Future Generation Computer Systems, Vol. 37, pp. 64-75, July 2014
- Exploiting Implementation Diversity and Partial Connection of Routers in Application-Specific Network-on-Chip Topology Synthesis
- Minje Jun, Won Woo Ro, and Eui-Young Chung
- IEEE Transactions on Computers, Vol. 63, No. 6, pp. 1434-1445, Jun. 2014
- Accelerating MapReduce Framework on Multi-GPU Systems
- Hai Jiang, Yi Chen, Zhi Qiao, Kuan-Ching Li, Won Woo Ro, and Jean-Luc Gaudiot
- Cluster Computing, Vol. 17, No. 2, pp. 293-301, Jun. 2014
- Boosting CUDA Applications with CPU-GPU Hybrid Computing
- Changmin Lee, Won Woo Ro, and Jean-Luc Gaudiot
- International Journal of Parallel Programming, Vol. 42, No. 2, pp. 384-404, Apr. 2014
- This is an extension of our INTERACT-16 paper which has been selected as one of the best papers and recommended to IJPP.
Conference Papers
- LUT based Secure Cloud Computing - an Implementation using FPGAs
- Lei Xu, Pham Dang Khoa, Seung Hun Kim, Won Woo Ro, and Weidong Shi
- 2014 International Conference on ReConFigurable Computing and FPGAs
- (ReConFig 2014)
- Cancun, Mexico, Dec. 7 - 10, 2014
- Workload Synthesis: Generating Benchmark Workloads from Statistical Execution Profile
- Keunsoo Kim, Changmin Lee, Jung Ho Jung, and Won Woo Ro
- IEEE International Symposium on Workload Characterization
- (IISWC 2014)
- Raleigh, North Carolina, USA, Oct. 26 - 28, 2014
- Accelerating Gesture Recognition Algorithm Using Coarse Grained Reconfigurable Architectures
- Minsik Kim, Deokho Kim, Minyong Sung, Wonjae Lee, Jaehyun Kim, and Won Woo Ro
- The 4th International Conference on Audio, Language and Image Processing
- (ICALIP 2014)
- Shanghai, China, July 7 - 9, 2014
- A Micro-benchmark Suite to Understand Micro-Architectural Differences between Processors
- Changmin Lee, Keunsoo Kim, Jung Ho Jung, and Won Woo Ro
- The 29th International Technical Conference on Circuits/Systems, Computers and Communications
- (ITC-CSCC 2014)
- Phuket, Thailand, July 1 - 4, 2014
- Maximizing DRAM Performance using Selective Operating Frequency Boosting
- Jung Ho Jung, Seung Hun Kim, Changmin Lee, and Won Woo Ro
- The 18th International Symposium on Consumer Electronics
- (ISCE 2014)
- Jeju, Korea, Jun. 22 - 25, 2014
- Workload and Variation Aware Thread Scheduling for Heterogeneous Multi-processor
- Seungwon Lee and Won Woo Ro
- The 18th International Symposium on Consumer Electronics
- (ISCE 2014)
- Jeju, Korea, Jun. 22 - 25, 2014
- Best paper award, Bronze prize
- DPM: Data Partitioning Method for Pipelined MapReduce on GPU
- Myung Hyun Jo and Won Woo Ro
- The 18th International Symposium on Consumer Electronics
- (ISCE 2014)
- Jeju, Korea, Jun. 22 - 25, 2014
- Accelerating HEVC Transcoder by Exploiting Decoded Quadtree
- Minyong Sung, Minwoo Kim, Minsik Kim, and Won Woo Ro
- The 18th International Symposium on Consumer Electronics
- (ISCE 2014)
- Jeju, Korea, Jun. 22 - 25, 2014
- Multicore Speedup Models using Frequency Scaling with Fixed Power Budget
- Seungwon Lee, Seung Hun Kim, and Won Woo Ro
- The 13th International Conference on Electronics, Information and Communication
- (ICEIC 2014)
- Kota Kinabalu, Malaysia, Jan. 15 - 18, 2014
- Hyper Threading-aware Virtual Machine Migration
- Chungmu Oh, and Won Woo Ro
- The 13th International Conference on Electronics, Information and Communication
- (ICEIC 2014)
- Kota Kinabalu, Malaysia, Jan. 15 - 18, 2014
- Development of Efficient VCPU Pinning Mechanism in Xen
- Kyung Yoon Min, Seung Hun Kim, and Won Woo Ro
- The 13th International Conference on Electronics, Information and Communication
- (ICEIC 2014)
- Kota Kinabalu, Malaysia, Jan. 15 - 18, 2014
Journal Papers
- Parallelized Sub-Resource Loading for Web Rendering Engine
- Deokho Kim, Changmin Lee, Sangpil Lee, and Won Woo Ro
- Journal of Systems Architecture, Vol. 59, No. 9, pp. 785-793, Oct. 2013
- Design and Evaluation of Random Linear Network Coding Accelerators on FPGAs

- Sunwoo Kim, Won Seob Jeong, Won Woo Ro, and Jean-Luc Gaudiot
- ACM Transactions on Embedded Computing Systems, Vol.13, No. 1, pp. 1-24, Aug. 2013
- GPU-Friendly Parallel Genome Matching with Tiled Access and Reduced State Transition Table
- Yunho Oh, Doohwan Oh, and Won Woo Ro
- International Journal of Parallel Programming, Vol. 41, No. 4, pp. 526-551, Aug. 2013
- A Distributed Signature Detection Method for Detecting Intrusions in Sensor Systems
- Ilkyu Kim, Doohwan Oh, Myung Kuk Yoon, Kyueun Yi, and Won Woo Ro
- Sensors, Vol. 13, No. 4, pp. 3998-4016, Mar. 2013
- Exploiting SIMD Parallelism on Dynamically Partitioned Parallel Network Coding for P2P Systems
- Deokho Kim, Karam Park, and Won Woo Ro
- Computers & Electrical Engineering, Vol. 39, No. 1, pp. 55-56, Jan. 2013
- Benefits of Using Parallelized Non-Progressive Network Coding
- Minwoo Kim, Karam Park, and Won Woo Ro
- Journal of Network and Computer Applications, Vol. 36, No. 1, pp. 293-305, Jan. 2013
- Importance of Coherence Protocols with Network Applications on Multi-Core Processors

- Kyueun Yi, Won Woo Ro, and Jean-Luc Gaudiot
- IEEE Transactions on Computers, Vol. 62, No. 1, pp. 6-15, Jan. 2013
Conference Papers
- Effcient Descriptor-Filtering Algorithm for Speeded Up Robust Features Matching
- Minwoo Kim, Deokho Kim, Kyungah Kim, and Won Woo Ro
- The 5th FTRA International Conference on Computer Science and its Applications
- (CSA-13)
- Danang, Vietnam, Dec. 18 - 21, 2013
- XSD: Accelerating MapReduce by Harnessing the GPU inside an SSD
- Benjamin Y. Cho, Won Seob Jeong, Doohwan Oh, and Won Woo Ro
- The 1st Workshop on Near-Data Processing. In conjunction with the MICRO-46
- (WoNDP 2013)
- Davis, USA, Dec. 8, 2013
- Mark-Sharing: A Parallel Garbage Collection Algorithm for Low Synchronization Overhead
- Hyunkyu Park, Changmin Lee, Seung Hun Kim, Won Woo Ro and Jean-Luc Gaudiot
- The 19th IEEE International Conference on Parallel and Distributed Systems
- (ICPADS 2013)
- Seoul, Korea, Dec. 15 - 18, 2013
- Leveraging Effectiveness of Contention Management for Transactional Memory Systems with Performance Monitoring
- Keunsoo Kim, Seung Hun Kim, Sang-min Lee, and Won Woo Ro
- The 28th International Technical Conference on Circuits/Systems, Computer and Communications
- (ITC-CSCC 2013)
- Yeosu, Korea, Jun. 30 - July 3, 2013
- MGMR: Multi-GPU Based MapReduce
- Yi Chen, Zhi Qiao, Hai Jiang, Kuan-Ching Li, Won Woo Ro
- The 8th International Conference on Grid and Pervasive Computing
- (GPC 2013)
- Seoul, Korea, May. 9 - 11, 2013
- Parallel GPU Architecture Simulation Framework Exploiting Work Allocation Unit Parallelism

- Sangpil Lee and Won Woo Ro
- The 2013 IEEE International Symposium on Performance Analysis of Systems and Software
- (ISPASS 2013)
- Austin, TX, USA, Apr. 21 - 23, 2013
- Directory Centralized Ring-based Interconnection for Multi-Core Systems
- Myung Kuk Yoon, Sangpil Lee, Deokho Kim, and Won Woo Ro
- The 12th International Conference on Electronics, Information and Communication
- (ICEIC 2013)
- Bali, Indonesia, Jan. 30 - Feb. 2, 2013
- Parallel Garbage Collection with Transactional Memory
- Hyunkyu Park, Changmin Lee, and Won Woo Ro
- The 12th International Conference on Electronics, Information and Communication
- (ICEIC 2013)
- Bali, Indonesia, Jan. 30 - Feb. 2, 2013
Journal Papers
- Multi-Threading and Suffix Grouping on Massive Multiple Pattern Matching Algorithm
- Doohwan Oh and Won Woo Ro
- The Computer Journal, Vol. 55, No. 11, pp. 1331-1346, Nov. 2012
- Offloading of Media Transcoding for High-Quality Multimedia Services
- Seung Hun Kim, Keunsoo Kim, Changmin Lee, and Won Woo Ro
- IEEE Transactions on Consumer Electronics, Vol. 58, No. 2, pp. 691-699, May 2012
- Design of a Power-Efficient Parallel Pipelined Bloom Filter
- Deokho Kim, Doohwan Oh, and Won Woo Ro
- Electronics Letters, Vol. 48, No. 7, pp. 367-369, Mar. 2012
- Reconfigurable and Parallelized Network Coding Decoder for VANETs
- Sunwoo Kim and Won Woo Ro
- Mobile Information Systems, Vol. 8, No. 1, pp. 45-59, Feb. 2012
- Accelerated Network Coding with Dynamic Stream Decomposition on Graphics Processing Unit
- Sangpil Lee and Won Woo Ro
- The Computer Journal, Vol. 55, No. 1, pp. 21-34, Jan. 2012
Conference Papers
- On Migration and Consolidation of VMs in Hybrid CPU-GPU Environments
- Kuan-Ching Li, Keunsoo Kim, Won Woo Ro, Tien-Hsiung Weng, Che-Lun Hung, Chen-Hao Ku, Albert Cohen, and Jean-Luc Gaudiot
- International Conference on Intelligent Technologies and Engineering Systems
- (ICITES 2012) - LNEE
- Changhua, Taiwan, Dec. 13-15, 2012
- Conflict Avoidance Scheduling using Grouping List for Transactional Memory
- Dongmin Choi, Seung Hun Kim, and Won Woo Ro
- The 17th International Workshop on High-Level Parallel Programming Models and Supportive Environments
- (HIPS-17)
- Shanghai, China, May 21, 2012
- Cooperative Heterogeneous Computing for Parallel Processing on CPU/GPU Hybrids

- Changmin Lee, Won Woo Ro, and Jean-Luc Gaudiot
- The 16th Workshop on Interaction between Compilers and Computer Architectures
- New Orleans, USA, Feb. 25 - 29, 2012
- Matrix Data Layout Optimization for Multi-Core Architectures
- Minwoo Kim, and Won Woo Ro
- The 11th International Conference on Electronics, Information and Communication
- (ICEIC 2012)
- Jeongseon, Korea, Feb. 1 - 3, 2012
- The Effect of Concurrency Control in Transactional Memory Systems
- Seung Hun Kim, Dongmin Choi, and Won Woo Ro
- The 11th International Conference on Electronics, Information and Communication
- (ICEIC 2012)
- Jeongseon, Korea, Feb. 1 - 3, 2012
- Adaptive Replacement Cache in Transactional Memory
- Dongmin Choi, Hyunkyu Park, Seung Hun Kim, and Won Woo Ro
- The 11th International Conference on Electronics, Information and Communication
- (ICEIC 2012)
- Jeongseon, Korea, Feb. 1 - 3, 2012
Journal Papers
- A Novel Sequential Tree Algorithm Based on Scoreboard for MPI Broadcast Communication
- Won-young Chung, Jae-won Park, Seung-Woo Lee, Won Woo Ro, and Yong-surk Lee
- IEICE Transactions on Information and Systems, Vol 94, No. 12, pp. 2523-2527, December. 2011
- Network Coding on Heterogeneous Multi-Core Processors for Wireless Sensor Networks
- Deokho Kim, Karam Park, and Won W. Ro
- Sensors, Vol 11, No. 8, pp. 7908-7933, Aug. 2011
- A Low-Cost Standard Mode MPI Hardware Unit for Embedded MPSoC
- Won-Young Chung, Ha-Young Jeong, Won W. Ro, and Yong-Surk Lee
- IEICE Transactions on Information and Systems, Vol. E94-D, No.7, pp. 1497-1501, July 2011
Conference Papers
- Parallel Transpose of Matrix Multiplication Based on the Tiling Algorithm
- Minwoo Kim, Yong J. Jang, and Won W. Ro
- The 54th IEEE International Midwest Symposium on Circuits and Systems
- (MWSCAS 2011)
- Seoul, Korea, Aug. 7 - 10, 2011
- Performance Evaluation of Adaptive Progressive Network Coding
- Deokho Kim, Karam Park, and Won W. Ro
- The 54th IEEE International Midwest Symposium on Circuits and Systems
- (MWSCAS 2011)
- Seoul, Korea, Aug. 7 - 10, 2011
Journal Papers
- Multithreaded Pattern Matching Algorithm with Data Rearrangement
- Doohwan Oh, Seung Hun Kim, and Won W. Ro
- IEICE Electronics Express, Vol. 7, No. 20, pp. 1520-1526, Oct. 2010
- On Improving Parallelized Network Coding with Dynamic Partitioning
- Karam Park, Joon-Sang Park, and Won W. Ro
- IEEE Transactions on Parallel and Distributed Systems, Vol. 21, No. 11, pp. 1547-1560, Nov. 2010
- Hardware Implementation of a Tessellation Accelerator for the OpenVG Standard
- Seung Hun Kim, Yunho Oh, Karam Park, and Won W. Ro
- IEICE Electronics Express, Vol. 7, No. 6, pp. 440-446, Mar. 2010
Conference Papers
- Development of Virtual CUDA Systems of Parallel Processing on CPU and GPGPU
- Doohwan Oh, Sangpil Lee, Deokho Kim, Changmin Lee, and Won W. Ro
- Workshop on Micro Architectural Support for Virtualization, Data Center Computing, and Clouds In Conjunction with MICRO 2010
- (MASVDC Workshop 2010)
- Atlanta, USA, Dec. 5, 2010
- Implementing FFT using SPMD style of OpenMP
- Tien-Hsiung Weng, Sheng-Wei Huang, Won Woo Ro, and Kuan-Ching Li
- In Proc. of the 6th International Conference on Networked Computing and Advanced Information Management
- (NCM 2010)
- Seoul, Korea, Aug. 16 - 18, 2010
- Multi-Threaded Filtered BackProjection Algorithm on Multi-Core Processors
- Yun H. Oh and Won W. Ro
- The 10th International Conference on Electronics, Information, and Communication
- (ICEIC 2010)
- Cebu, Philippines, Jun. 30 - July 2, 2010
- Accelerated Reconstruction Using Parallel Computing for Spiral Spectroscopic Imaging
- Dong H. Kim, Yun H. Oh, Yun H. Nam, M. Gu, and Won W. Ro
- In Proc. of 2010 International Society for Magnetic Resonance in Medicine Annual Meeting
- (2010 ISMRM Annual Meeting)
- Stockholm, Sweden, May 1 - 7, 2010
- FPGA Implementation of Highly Parallelized Decoder Logic for Network Coding
- Sunwoo Kim and Won W. Ro
- In Proc. of Eighteenth ACM/SIGDA International Symposium on Field-Programmable Gate Arrays
- (FPGA 2010)
- Monterey, USA, Feb. 21 - 23, 2010
Journal Papers
- A Complexity-Effective Microprocessor Design with Decoupled Dispatch Queues and Prefetching
- Won W. Ro and Jean-Luc Gaudiot
- Parallel Computing, Vol. 35, No. 5, pp. 255-268, May 2009
Conference Papers
- Evaluation of Cache Coherence Protocols on Multi-Core Systems with Linear Workloads
- Yong J. Jang and Won W. Ro
- In Proc. of 2009 International Colloquium on Computing, Communication, Control, and Management
- (CCCM 2009)
- Sanya, China, Aug. 8 - 9, 2009
- Comparing Open Source Web Services: gSoap and AXIS
- Jongwook Woo and Won W. Ro
- In Proc. of the 24th International Technical Conference on Circuits/Systems, Computers and Communications
- (ITC-CSCC 2009)
- Jeju Island, Korea, July 5 - 8, 2009
- Efficient Parallelized Network Coding for P2P File Sharing Applications
- Karam Park, Joon-Sang Park, and Won W. Ro
- In Proc. of the 4th International Conference on Grid and Pervasive Computing
- (GPC 2009)
- Geneva, Switcherland, May 4 - 8, 2009
- Fully Pipelined Hardware Implementation of 128-bit SEED Block Cipher Algorithm
- Jaeyoung Yi, Karam Park, Joonseok Park, and Won W. Ro
- In Proc. of the 5th International Workshop on Applied Reconfigurable Computing
- (ARC 2009)
- Karlsruhe, Germany, Mar. 16 - 18, 2009
Book Chapters
- Programmability and Scalability on Multi-Core Architectures
- Jaeyoung Yi, Yong J. Jang, Doohwan Oh, and Won W. Ro
- Chapter in "Handbook of Research on Scalable Computing Technologies", edited by Kuan-Ching Li, Ching-Hsien Hsu, Laurence Tianruo Yang, Jack Dongarra, and Hans Zima, Information Science Reference, 2009
Journal Papers
- Efficient Peer-to-Peer File Sharing Using Network Coding in MANET
- Uichin Lee, Joon-Sang Park, Seung-Hoon Lee, Won W. Ro, Giovanni Pau, and Mario Gerla
- Journal of Communications and Networks, Vol. 10, No. 4, Dec. 2008
- A Low-Complexity Microprocessor Design with Speculative Pre-Execution
- Won W. Ro and Jean-Luc Gaudiot
- Journal of Systems Architecture, Vol. 54, No. 12, pp. 1101-1112, Dec. 2008
- Performance Evaluation of Programming Models for SMP-Based Clusters
- Myungho Lee, Neungsoo Park, Won W. Ro, and Kuan-Ching Li
- Journal of the Chinese Institute of Engineers, Vol. 31, No. 7, pp. 1181-1188, Dec. 2008
- Simultaneous Thin-Thread Processors for Low-Power Embedded Systems
- Won W. Ro, Jaeyoung Yi, Joon-Sang Park, and Joonseok Park
- IEICE Electronics Express, Vol. 5, No. 19, pp. 802-808, Oct. 2008
- Delay Analysis of Car-to-Car Reliable Data Delivery Strategies Based on Data Mulling with Network Coding
- Joon-Sang Park, Uichin Lee, Soon Young Oh, Mario Gerla, Desmond Siumen Lun, Won W. Ro, and Joonseok Park
- IEICE Transactions on Information and Systems, Vol. E91-D, No. 10, Oct. 2008
Conference Papers
- Parallel Algorithms for Steiner Tree Problem
- Joon-Sang Park, Won W. Ro, Handuck Lee, and Neungsoo Park
- In Proc. of the 3rd International Conference on Convergence and Hybrid Information Technology
- (ICHIT 2008)
- Busan, Korea, Nov. 11 - 13, 2008
Journal Papers
- Design and Evaluation of a Hierarchical Decoupled Architecture
- Won W. Ro, Stephen P. Crago, Alvin M. Despain, and Jean-Luc Gaudiot
- Journal of Supercomputing, Springer, Vol. 38, No. 3, pp. 237-259, Dec. 2006
- Speculative Pre-Execution Assisted by Compiler (SPEAR)
- Won W. Ro and Jean-Luc Gaudiot
- Journal of Parallel and Distributed Computing, Elsevier, Vol. 66, No. 8, pp. 1076-1089, Aug. 2006
Conference Papers
- Design and Effectiveness of Small-Sized Decoupled Dispatch Queues
- Won W. Ro and Jean-Luc Gaudiot
- In Proc. of European Conference on Parallel Computing - LNCS
- (EURO-PAR 2006)
- Dresden, Germany, Aug. 29 - Sep. 1, 2006
Conference Papers
- A Low-Complexity Issue Queue Design with Speculative Pre-Execution
- Won W. Ro and Jean-Luc Gaudiot
- In Proc. of the 12th International Conference on High Performance Computing
- (HiPC 2005)
- Goa, India, Dec. 18 - 21, 2005
Book Chapters
- Techniques to Improve Performance Beyond Pipelining: Superpipelining, Superscalar, and VLIW
- Jean-Luc Gaudiot, Jung-Yup Kang, and Won Woo Ro
- Chapter in "Computer Architecture", a volume of "Advance in Computers", edited by Ali R.Hurson, Elsevier, 2005
Conference Papers
- SPEAR: A Hybrid Model for Speculative Pre-Execution
- Won W. Ro and Jean-Luc Gaudiot
- In Proc. of the 18th International Parallel and Distributed Processing Symposium
- (IPDPS 2004)
- Santa Fe, New Mexico, 2004
Conference Papers
- HiDISC: A Decoupled Architecture for Data-Intensive Applications
- Won W. Ro, Jean-Luc Gaudiot, Stephen P. Crago, and Alvin M. Despain
- In Proc. of the 17th International Parallel and Distributed Processing Symposium
- (IPDPS 2003)
- Nice, France, Apr. 22 - 26, 2003
- Compiler Support for Dynamic Speculative Pre-Execution
- Won W. Ro and Jean-Luc Gaudiot
- In Proc. of the 7th Annual Workshop on Interaction between Compilers and Computer Architectures
- (INTERACT-7) in conjunction with HPCA-9
- Anaheim, California, Feb. 8, 2003
Conference Papers
- Memory Latency: to Tolerate or to Reduce?
- Amol Bakshi, Jean-Luc Gaudiot, Wen-Yen Lin, Manil Makhija, Viktor K. Prasanna, Wonwoo Ro, and Chulho Shin
- In Proc. of the 12th Symposium on Computer Architecture and High Performance Computing
- (SBAC-PAD'00)
- Sao Pedro, Brazil, Oct. 24 - 27, 2000
- A High-Performance, Hierarchical Decoupled Architecture
- Stephen P. Crago, Alvin Despain, Jean-Luc Gaudiot, Manil Makhija, Wonwoo Ro, and Apoorv Srivastava
- In Proc. of the Memory access Decoupling for superscalar and multiple issue Architectures
- (MEDEA) Workshop in conjunction with PACT 2000
- Philadelphia, Oct. 15, 2000
- A Reliable Cluster Computing with a New Checkpointing RAID-x Architecture
- Kai Hwang, Hai Jin, Roy Ho, and Wonwoo Ro
- In Proc. of the 9th Heterogeneous Computing Workshop
- (HCW)
- Cancun, Mexico, May 1, 2000