Selected Publications
2025
In Press
Conference Papers
- Ditto: Accelerating Diffusion Model via Temporal Value Similarity
- Sungbin Kim*, Hyunwuk Lee*, Wonho Cho, Mincheol Park, and Won Woo Ro
[Top-Tier]  
The 31st IEEE International Symposium on High-Performance Computer (HPCA), 2025   (IF: 4, NRF BK21four)
- Marching Page Walks: Batching and Concurrent Page Table Walks for Enhancing GPU Throughput
- Jiwon Lee, Gun Ko, Myung Kuk Yoon, Ipoom Jeong, Yunho Oh, and Won Woo Ro
[Top-Tier]  
The 31st IEEE International Symposium on High-Performance Computer (HPCA), 2025   (IF: 4, NRF BK21four)
- Qubit Movement-Optimized Program Generation on Zoned Neutral Atom Processors
- Enhyeok Jang, Youngmin Kim, Hyungseok Kim, Seungwoo Choi, Yipeng Huang, and Won Woo Ro
[Top-Tier]  
The IEEE/ACM International Symposium on Code Generation and Optimization (CGO), 2025   (IF: 3, NRF BK21four)
2024
Journal Papers
- SHREG: Mitigating Register Redundancy in GPUs
[Link]
- Seunghyun Jin, Hyunwuk Lee, Jonghyun Lee, Junsung Kim, and Won Woo Ro
[SCI-Q1]  
Journal of Systems Architecture Vol. 152, July. 2024 
(IF: 4.5, Q1, JCR2022)
Conference Papers
- DEPrune: Depth-wise Separable Convolution Pruning for Maximizing GPU Parallelism
- Cheonjun Park, Mincheol Park, Hyunchan Moon, Myung Kuk Yoon, Seokjin Go, Suhyun Kim, and Won Woo Ro
[Top-Tier]  
The 38th Annual Conference on Neural Information Processing Systems (NeurIPS), 2024   (IF: 4, NRF BK21four, Acceptance Rate: 25.8%)
- AirGun: Adaptive Granularity Quantization for Accelerating Large Language Models
- Sungbin Kim, Hyunwuk Lee, Sungwoo Kim, Cheolhwan Kim, and Won Woo Ro
- The International Conference on Computer Design (ICCD), 2024   (IF: 1, NRF BK21four, Acceptance Rate: 28%)
- MOSQ: Accelerating Classical Simulation of UCCSD Ansatz Circuits using Merged Operation
- Seungwoo Choi, Enhyeok Jang, Youngmin Kim, and Won Woo Ro
- The International Conference on Computer Design (ICCD), 2024   (IF: 1, NRF BK21four, Acceptance Rate: 28%)
- Generalizing Ray Tracing Accelerators for Tree Traversals on GPUs
- Dongho Ha*, Lufei Liu*, Yuan Hsi Chou, Seokjin Go, Won Woo Ro, Hung-Wei Tseng, and Tor M. Aamodt
[Top-Tier]  
The International Symposium on Microarchitecture (MICRO), 2024   (IF: 4, NRF BK21four, Acceptance Rate: 22.7%)
- Barber: Balancing Thermal Relaxation Deviations of NISQ Programs by Exploiting Bit-Inverted Circuits
- Enhyeok Jang, Seungwoo Choi, Youngmin Kim, Jeewoo Seo, and Won Woo Ro
[Top-Tier]  
The 2024 ACM/IEEE International Conference on Computer-Aided Design (ICCAD), 2024   (IF: 3, NRF BK21four, Acceptance Rate: 24%)
- Recompiling QAOA Circuits on Various Rotational Directions
[Link]
- Enhyeok Jang, Dongho Ha, Seungwoo Choi, Youngmin Kim, Jaewon Kwon, Yongju Lee, Sungwoo Ahn, Hyungseok Kim, and Won Woo Ro
[Top-Tier]  
The 33rd International Conference on Parallel Architectures and Compilation Techniques (PACT), 2024   (IF: 3, NRF BK21four)
- M3XU: Achieving High-Precision and Complex Matrix Multiplication with Low-Precision MXUs
- Dongho Ha, Yunan Zhang, Chen-Chien Kao, Christopher J. Hughes, Won Woo Ro, and Hung-Wei Tseng
[Top-Tier]  
The International Conference for High Performance Computing, Networking, Storage, and Analysis (SC), 2024   (IF: 3, NRF BK21four, Acceptance Rate: 22.7%)
- GUMSO: Gating Unnecessary On-Chip Memory Slices for Power Optimization on GPUs
[Link]
- Seunghyun Jin, Hyunwuk Lee, and Won Woo Ro
- The 2024 ACM/IEEE International Symposium on Low Power Electronics and Design (ISLPED), 2024   (IF: 1, NRF BK21four)
- Geneva: A Dynamic Confluence of Speculative Execution and In-Order Commitment Windows
[Link]
- Yanghee Lee, Jiwon Lee, Jaewon Kwon, Yongju Lee, and Won Woo Ro
[Top-Tier]  
The 61th Design Automation Conference (DAC), 2024   (IF: 3, NRF BK21four, Acceptance Rate: 23%)
- REPrune: Channel Pruning via Kernel Representative Selection
[Link]
- Mincheol Park, Dongjin Kim, Cheonjun Park, Yuna Park, Gyeong Eun Gong, Won Woo Ro, and Suhyun Kim
[Top-Tier]  
The 38th AAAI Conference on Artificial Intelligence (AAAI), 2024   (IF: 4, NRF BK21four, Acceptance Rate: 23.7% [2342/12100])
2023
Journal Papers
- A Convertible Neural Processor Supporting Adaptive Quantization for Real-Time Neural Networks
[Link]
- Hongju Kal, Hyoseong Choi, Ipoom Jeong, Joon-Sung Yang, and Won Woo Ro
[SCI-Q1]  
Journal of Systems Architecture Vol. 145, Nov. 2023 
(IF: 4.5, Q1, JCR2022)
Conference Papers
- INTERPRET: Inter-Warp Register Reuse for GPU Tensor Cores
[Link]
- Jae Seok Kwak, Myung Kuk Yoon, Ipoom Jeong, Seunghyun Jin, and Won Woo Ro
[Top-Tier]  
The 32th International Conference on Parallel Architectures and Compilation Techniques (PACT), 2023   (IF: 3, NRF BK21four)
- McCore: A Holistic Management of High-Performance Heterogeneous Multicores
[Link]
- Jaewon Kwon, Yongju Lee, Hongju Kal, Minjae Kim, Youngsok Kim, and Won Woo Ro
[Top-Tier]  
The 56th International Symposium on Microarchitecture (MICRO), 2023   (IF: 4, NRF BK21four, Acceptance Rate: 23.8% [101/424])
- AESPA: Asynchronous Execution Scheme to Exploit Bank-Level Parallelism of Processing-in-Memory
[Link]
- Hongju Kal, Chanyoung Yoo, and Won Woo Ro
[Top-Tier]  
The 56th International Symposium on Microarchitecture (MICRO), 2023   (IF: 4, NRF BK21four, Acceptance Rate: 23.8% [101/424])
- MAD MAcce: Supporting Multiply-Add Operations for Democratizing Matrix-Multiplication Accelerator
[Link]
- Seunghwan Sung, Sujin Hur, Dongho Ha, Sungwoo Kim, Yunho Oh, and Won Woo Ro
[Top-Tier]  
The 56th International Symposium on Microarchitecture (MICRO), 2023   (IF: 4, NRF BK21four, Acceptance Rate: 23.8% [101/424])
- Exploiting Inherent Properties of Complex Numbers for Accelerating Complex Valued Neural Networks
[Link]
- Hyunwuk Lee, Hyungjun Jang, Sungbin Kim, Sungwoo Kim, Wonho Cho, and Won Woo Ro
[Top-Tier]  
The 56th International Symposium on Microarchitecture (MICRO), 2023   (IF: 4, NRF BK21four, Acceptance Rate: 23.8% [101/424])
- TensorCV: Accelerating Inference-Adjacent Computation Using Tensor Processors
[Link]
- Dongho Ha, Won Woo Ro, and Hung-Wei Tseng
- The 2023 ACM/IEEE International Symposium on Low Power Electronics and Design (ISLPED), 2023   (IF: 1, NRF BK21four)
- R2D2: Removing ReDunDancy Utilizing Linearity of Address Generation in GPUs
[Link]
- Dongho Ha, Yunho Oh, and Won Woo Ro
[Top-Tier]  
The 50th International Symposium on Computer Architecture (ISCA), 2023   (IF: 4, NRF BK21four, Acceptance Rate: 21.2% [79/372])
- Early-Adaptor: An Adaptive Framework for Proactive UVM Memory Management
[Link]
- Seokjin Go, Hyunwuk Lee, Junsung Kim, Jiwon Lee, Myung Kuk Yoon, and Won Woo Ro
- The 2023 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), 2023   (IF: 1, NRF BK21four, Acceptance Rate: 37.8%)
- Lightning Talk: Efficiency and Programmability of DNN Accelerators and GPUs
[Link]
- Won Woo Ro
[Top-Tier]  
The 60th Design Automation Conference (DAC), 2023   (IF: 3, NRF BK21four, Acceptance Rate: 23%)
- Quixote: Improving Fidelity of Quantum Program by Independent Execution of Controlled Gates
[Link]
- Enhyeok Jang, Seungwoo Choi, and Won Woo Ro
[Top-Tier]  
The 60th Design Automation Conference (DAC), 2023   (IF: 3, NRF BK21four, Acceptance Rate: 23%)
- Balanced Column-Wise Block Pruning for Maximizing GPU Parallelism
[Link]
- Cheonjun Park, Mincheol Park, Hyun Jae Oh, Minkyu Kim, Myung Kuk Yoon, Suhyun Kim, and Won Woo Ro
[Top-Tier]  
The 37th AAAI Conference on Artificial Intelligence (AAAI), 2023   (IF: 4, NRF BK21four, Oral Acceptance Rate: 10.8% [952/8777], Oral Presentation)
- SnakeByte: A TLB Design with Adaptive and Recursive Page Merging in GPUs
[Link]
- Jiwon Lee, Ju Min Lee, Yunho Oh, William J. Song, and Won Woo Ro
[Top-Tier]  
The 29th IEEE International Symposium on High-Performance Computer (HPCA), 2023   (IF: 4, NRF BK21four, Acceptance Rate: 25.0% [91/364])
2022
Journal Papers
- TEA-RC: Thread Context-Aware Register Cache for GPUs
[Link]
- Ipoom Jeong, Yunho Oh, Won Woo Ro, and Myung Kuk Yoon
[SCI-Q2]  
IEEE Access  
(IF: 3.476, Q2, JCR2021)
- CASH-RF: A Compiler-Assisted Hierarchical Register File in GPUs
[Link]
- Yunho Oh, Ipoom Jeong, Won Woo Ro, and Myung Kuk Yoon
[SCI-Q2]  
IEEE Embedded Systems Letters  
(IF: 2.169, Q2, JCR2020)
- FLIXR: Embedding Index into Flash Translation Layer in SSDs
[Link]
- Gunjae Koo, Yunho Oh, Hung-Wei Tseng, Won Woo Ro, and Murali Annavaram
[SCI-Q2]  
IEEE Transactions on Computers, doi: 10.1109/TC.2022.3154602., Feb. 2022  
(IF: 2.663, Q2, JCR2020)
Conference Papers
- Reconstructing Out-of-Order Issue Queue
[Link]
- Ipoom Jeong, Jiwon Lee, Myung Kuk Yoon, and Won Woo Ro
[Top-Tier]  
The 55th IEEE/ACM International Symposium on Microarchitecture (MICRO), 2022   (IF: 4, NRF BK21four, Acceptance Rate: 23.8% [83/348])
2021
Journal Papers
- Two-Stage In-Storage Processing and Scheduling for Pattern Matching Applications
[Link]
- Joohyeong Yoon, Yoonjin Lee, Won Seob Jeong, and Won Woo Ro
[SCI-Q1]  
IEEE Access, Vol. 9, pp. 95702-95715, Jun. 2021  
(IF: 3.367, Q1, JCR2020)
- PIMCaffe: Functional Evaluation of a Machine Learning Framework for In-Memory Neural Processing Unit
[Link]
- Won Jeon, Jiwon Lee, Dongseok Kang, Hongju Kal, and Won Woo Ro
[SCI-Q1]  
IEEE Access, Vol. 9, pp. 96629-96640, Jul. 2021  
(IF: 3.367, Q1, JCR2020)
Conference Papers
- SPACE: Locality-Aware Processing in Heterogeneous Memory for Personalized Recommendations
[Link]
- Hongju Kal, Seokmin Lee, Gun Ko, and Won Woo Ro
[Top-Tier]  
The 48th ACM/IEEE International Symposium on Computer Architecture (ISCA), 2021   (IF: 4, NRF BK21four, Acceptance Rate: 18.7% [76/406])
2020
Journal Papers
- Hi-End: Hierarchical, Endurance-Aware STT-MRAM-Based Register File for Energy-Efficient GPUs
[Link]
- Won Jeon, Jun Hyun Park, Yoonsoo Kim, Gunjae Koo, and Won Woo Ro
[SCI-Q1]  
IEEE Access, Vol. 8, pp. 127768-127780, Jul. 2020  
(IF: 3.745, Q1, JCR2019)
- REACT: Scalable and High-Performance Regular Expression Pattern Matching Accelerator for In-Storage Processing
[Link]
- Won Seob Jeong, Changmin Lee, Keunsoo Kim, Myung Kuk Yoon, Won Jeon, Myoungsoo Jung, and Won Woo Ro
[SCI-Q1]  
IEEE Transactions on Parallel and Distributed Systems, Vol. 31, Issue 5, pp.1137-1151, May 2020  
(IF: 3.402, Q1, JCR2018)
Conference Papers
- Duplo: Lifting Redundant Memory Accesses of Neural Networks for GPU Tensor Cores
[Link]
- Hyeonjin Kim, Sungwoo Ahn, Yunho Oh, Bogil Kim, Won Woo Ro, and William J. Song
[Top-Tier]  
The 53rd IEEE/ACM International Symposium on Microarchitecture (MICRO), 2020   (IF: 4, NRF BK21four, Acceptance Rate: 19.4% [82/422])
- Check-In: In-Storage Checkpointing for Key-Value Store System Leveraging Flash-Based SSDs
[Link]
- Joohyeong Yoon, Won Seob Jeong, and Won Woo Ro
[Top-Tier]  
The 47th ACM/IEEE International Symposium on Computer Architecture (ISCA), 2020   (IF: 4, NRF BK21+, Acceptance Rate: 18.2% [77/421])
- CASINO Core Microarchitecture: Generating Out-of-Order Schedules Using Cascaded In-Order Scheduling Windows
[Link]
- Ipoom Jeong, Seihoon Park, Changmin Lee, and Won Woo Ro
[Top-Tier]  
The 26th IEEE International Symposium on High Performance Computer Architecture (HPCA), 2020   (IF: 4, NRF BK21+, Acceptance Rate: 16.9% [48/284])
2019
Journal Papers
- OverCome: Coarse-Grained Instruction Commit with Handover Register Renaming
[Link]
- Ipoom Jeong, Changmin Lee, Keunsoo Kim, and Won Woo Ro
[SCI-Q1]  
IEEE Transactions on Computers, Vol. 68, Issue 12, pp. 1802-1816, Dec. 2019
(IF: 3.131, Q1, JCR2018)
- Contents-Aware Partitioning Algorithm for Parallel High Efficiency Video Coding
[Link]
- Kyungah Kim and Won Woo Ro
- Multimedia Tools and Applications, Vol. 78, Issue 9, pp. 11427-11442, May 2019
(IF: 2.101, Q3, JCR2018)
- Fast CU Depth Decision for HEVC using Neural Networks
[Link]
- Kyungah Kim and Won Woo Ro
[SCI-Q1]  
IEEE Transactions on Circuits and Systems for Video Technology, Vol. 29, No. 5, pp. 1462-1473, May 2019
(IF: 4.046, Q1, JCR2018)
- Adaptive Cooperation of Prefetching and Warp Scheduling on GPUs
[Link]
- Yunho Oh, Keunsoo Kim, Myung Kuk Yoon, Jong Hyun Park, Yongjun Park, Murali Annavaram, and Won Woo Ro
[SCI-Q1]  
IEEE Transactions on Computers, Vol. 68, No. 4, pp. 609-616, Apr. 2019
(IF: 3.131, Q1, JCR2018)
Conference Papers
- Efficient Dilated-Winograd Convolutional Neural Networks
[Link]
- Minsik Kim, Cheonjun Park, Sungjun Kim, Taeyoung Hong, and Won Woo Ro
- The 2019 IEEE International Conference on Image Processing (ICIP), 2019 Taipei, Taiwan, Sep. 22 - 25(Acceptance Rate: 46.2% [956/2068])
- Linebacker: Preserving Victim Cache Lines in Idle Register Files of GPUs
[Link]
- Yunho Oh, Gunjae Koo, Murali Annavaram, and Won Woo Ro
[Top-Tier]  
The 46th ACM/IEEE International Symposium on Computer Architecture (ISCA), 2019 Phoenix, Arizona, USA, Jun. 22 - 26 (IF: 4, NRF BK21four, Acceptance Rate:17.0% [62/365])
2018
Journal Papers
- WASP: Selective Data Prefetching with Monitoring Runtime Warp Progress on GPUs
[Link]
- Yunho Oh, Myung Kuk Yoon, Jong Hyun Park, Yongjun Park, and Won Woo Ro
[SCI-Q1]  
IEEE Transactions on Computers, Vol. 67, No. 9, pp. 1366-1373, Sep. 2018
(IF: 3.052, Q1, JCR2017)
- Exploiting Pseudo-Quadtree Structure for Accelerating HEVC Spatial Resolution Downscaling Transcoder
[Link]
- Minsik Kim, Minyong Sung, Minwoo Kim, and Won Woo Ro
[SCI-Q1]  
IEEE Transactions on Multimedia, Vol. 20, No. 9, pp. 2262-2275, Sep. 2018
(IF: 3.977, Q1, JCR2017)
- Architectural Protection of Application Privacy against Software and Physical Attacks in Untrusted Cloud Environment
[Link]
- Lei Xu, JongHyuk Lee, Seung Hun Kim, Qingji Zheng, Shouhuai Xu, Taeweon Suh, Won Woo Ro, and Weidong Shi
[SCI-Q1]  
IEEE Transactions on Cloud Computing, Vol. 6, No. 2, pp. 478-491, Apr-Jun. 2018
(IF: 7.928, Q1, JCR2017)
- Simultaneous and Speculative Thread Migration for Improving Energy Efficiency of Heterogeneous Core Architectures
[Link]
- Changmin Lee and Won Woo Ro
[SCI-Q1]  
IEEE Transactions on Computers, Vol. 67, No. 4, pp. 498-512, Apr. 2018
(IF: 3.052, Q1, JCR2017)
Conference Papers
- FineReg: Fine-Grained Register File Management for Augmenting GPU Throughput
[Link]
- Yunho Oh, Myung Kuk Yoon, William J. Song, and Won Woo Ro
- The 51st IEEE/ACM International Symposium on Microarchitecture
[Top-Tier]  
(MICRO 2018) Fukuoka, Japan, Oct. 20 - 24, 2018
(IF: 4, NRF BK21+,Acceptance Rate:21.1% [74/351])
- WIR: Warp Instruction Reuse to Minimize Repeated Computations in GPUs
[Link]
- Keunsoo Kim and Won Woo Ro
- The 24th IEEE International Symposium on High Performance Computer Architecture
[Top-Tier]  
(HPCA 2018)Wien, Austria, Feb. 24 - 28, 2018
(IF: 4, NRF BK21+, Acceptance Rate:20.8% [54/260])-
2017
Journal Papers
- Dynamic Resizing on Active Warps Scheduler to Hide Operation Stalls on GPUs
[Link]
- Myung Kuk Yoon, Yunho Oh, Sangpil Lee, Seung Hun Kim, Deokho Kim, and Won Woo Ro
[SCI-Q1]  
IEEE Transactions on Parallel and Distributed Systems, Vol. 28, No. 11, pp. 3142-3156, Nov. 2017 (IF: 4.181, Q1, JCR2016)
- Dynamic Load Balancing of Dispatch Scheduling for Solid State Disks
[Link]
- Myunghyun Jo and Won Woo Ro
[SCI-Q1]  
IEEE Transactions on Computers, Vol. 66, No. 6, pp. 1034-1047, Jun. 2017 (IF: 2.916, Q1, JCR2016)
- Improving Energy Efficiency of GPUs through Data Compression and Compressed Execution
[Link]
- Sangpil Lee, Keunsoo Kim, Gunjae Koo, Hyeran Jeon, Murali Annavaram, and Won Woo Ro
[SCI-Q1]  
IEEE Transactions on Computers, Vol. 66, No. 5, pp. 834-847, May 2017 (IF: 2.916, Q1, JCR2016)
Conference Papers
- Access Pattern-Aware Cache Management for Improving Data Utilization in GPU
[Link]
- Gunjae Koo, Yunho Oh, Won Woo Ro, and Murali Annavaram
- The 44th ACM/IEEE International Symposium on Computer Architecture
[Top-Tier]  
(ISCA 2017) Torronto, Canada, Jun. 24 - 28, 2017 (IF: 4, NRF BK21+, Acceptance Rate:16.8% [54/322])
2016
Journal Papers
- Server Side, Play Buffer Based Quality Control for Adaptive Media Streaming
[Link]
- Keunsoo Kim, Benjamin Y. Cho, and Won Woo Ro
- Multimedia Tools and Applications, Vol. 75, No. 10, pp. 5397-5415, May 2016 (IF: 1.331, Q2, JCR2015)
- Exploiting Thread-Level Parallelism on HEVC by Employing Reference Dependency Graph
[Link]
- Minwoo Kim, Deokho Kim, Kyungah Kim, and Won Woo Ro
[SCI-Q1]  
IEEE Transactions on Circuits and Systems for Video Technology, Vol. 26, No. 4, pp. 736-749, Apr. 2016 (IF: 2.254, Q1, JCR2015)
- Parallel GPU Architecture Simulation Framework Exploiting Architectural-Level Parallelism with Timing Error Prediction
[Link]
- Sangpil Lee and Won Woo Ro
[SCI-Q1]  
IEEE Transactions on Computers, Vol. 65, No. 4, pp. 1253-1265, Apr. 2016 (IF: 1.723, Q1, JCR2015)
Conference Papers
- Virtual Thread: Maximizing Thread-Level Parallelism beyond GPU Scheduling Limit
[Link]
- Myung Kuk Yoon, Keunsoo Kim, Sangpil Lee, Won Woo Ro, and Murali Annavaram
- The 43rd ACM/IEEE International Symposium on Computer Architecture
[Top-Tier]  
(ISCA 2016)Seoul, Korea, Jun. 18 - 22, 2016 (IF: 4, NRF BK21+)
- APRES: Improving Cache Efficiency by Exploiting Load Characteristics on GPUs
[Link]
- Yunho Oh, Keunsoo Kim, Myung Kuk Yoon, Jong Hyun Park, Yongjun Park, Won Woo Ro, and Murali Annavaram
- The 43rd ACM/IEEE International Symposium on Computer Architecture
[Top-Tier]  
(ISCA 2016)Seoul, Korea, Jun. 18 - 22, 2016 (IF: 4, NRF BK21+)
- Warped-Slicer: Efficient Intra-SM Slicing through Dynamic Resource Partitioning for GPU Multiprogramming
[Link]
- Qiumin Xu, Hyeran Jeon, Keunsoo Kim, Won Woo Ro, and Murali Annavaram
- The 43rd ACM/IEEE International Symposium on Computer Architecture
[Top-Tier]  
(ISCA 2016) Seoul, Korea, Jun. 18 - 22, 2016 (IF: 4, NRF BK21+)
- Warped-Preexecution: A GPU Pre-execution Approach for Improving Latency Hiding
[Link]
- Keunsoo Kim, Sangpil Lee, Myung Kuk Yoon, Gunjae Koo, Won Woo Ro, and Murali Annavaram
- The 22nd IEEE International Symposium on High Performance Computer Architecture
[Top-Tier]  
(HPCA 2016)Barcelona, Spain, Mar. 12 - 16, 2016 (IF: 4, NRF BK21+)
2015
Journal Papers
- A Performance-Energy Model to Evaluate Single Thread Execution Acceleration
[Link]
- Seung Hun Kim, Dohoon Kim, Changmin Lee, Won Seob Jeong, Won Woo Ro, and Jean-Luc Gaudiot
- IEEE Computer Architecture Letters, Vol.14, No.2, pp. 99-102, Dec. 2015 (IF: 0.677, Q3, JCR2014)
- Dynamic Load Balancing of Parallel SURF with Vertical Partitioning
[Link]
- Deokho Kim, Minwoo Kim, Kyungah Kim, Minyong Sung, and Won Woo Ro
[SCI-Q1]  
IEEE Transactions on Parallel and Distributed Systems, Vol. 26, No. 12, pp. 3358-3370, Dec. 2015 (IF: 2.170, Q1, JCR2014)
- Network Variation and Fault Tolerant Performance Acceleration in Mobile Devices with Simultaneous Remote Execution
[Link]
- Keunsoo Kim, Benjamin Y. Cho, Won Woo Ro, and Jean-Luc Gaudiot
[SCI-Q1]  
IEEE Transactions on Computers, Vol. 64, No. 10, pp. 2862-2874, Oct. 2015 (IF: 1.659, Q1, JCR2014)
- Highly Secure Mobile Devices Assisted with Trusted Cloud Computing Environments
[Link]
- Doohwan Oh, Ilkyu Kim, Keunsoo Kim, Sang-Min Lee, and Won Woo Ro
- ETRI Journal, Vol. 37, No. 2, pp. 348-358, Apr. 2015 (IF: 0.771, Q3, JCR2014)
Conference Papers
- True Motion Compensation With Feature Detection for Frame Rate Up-Conversion
[Link]
- Kyungah Kim, Minwoo Kim, Deokho Kim, and Won Woo Ro
- The 2015 IEEE International Conference on Image Processing
- (ICIP 2015) Quebec City, Canada, Sep. 27 - 30, 2015
- An Accelerated Separable Median Filter with Sorting Networks
[Link]
- Minsik Kim, Deokho Kim, Minyong Sung, and Won Woo Ro
- The 2015 IEEE International Conference on Image Processing
- (ICIP 2015) Quebec City, Canada, Sep. 27 - 30, 2015
- Enhancing Software Dependability and Security with Hardware Supported Instruction Address Space Randomization
[Link]
- Seung Hun Kim, Lei Xu, Ziyi Liu, Zhiqiang Lin, Won Woo Ro, and Weidong Shi
- The 45th Annual IEEE/IFIP International Conference on Dependable Systems and Networks
- (DSN 2015) Rio de Janerio, Brazil, Jun. 22 - 25, 2015
- Warped-Compression: Enabling Power Efficient GPUs through Register Compression
[Link]
- Sangpil Lee, Keunsoo Kim, Gunjae Koo, Hyeran Jeon, Won Woo Ro, and Murali Annavaram
- The 42nd ACM/IEEE International Symposium on Computer Architecture
[Top-Tier]  
(ISCA 2015) Portland, OR, USA, Jun. 13 - 17, 2015 (IF: 4, NRF BK21+)
- DRAW: Investigating Benefits of Adaptive Fetch Group Size on GPU
[Link]
- Myung Kuk Yoon, Yunho Oh, Sangpil Lee, Seung Hun Kim, Deokho Kim, and Won Woo Ro
- The 2015 IEEE International Symposium on Performance Analysis of Systems and Software
- (ISPASS 2015) Philadelphia, PA, USA, Mar. 29 - 31, 2015
2014
Journal Papers
- A Malicious Pattern Detection Engine for Embedded Security Systems in Internet of Things
[Link]
- Doohwan Oh, Deokho Kim, and Won Woo Ro
- Sensors, Vol. 14, No. 12, pp. 24188-24211, Dec. 2014 (IF: 2.048, Q2, JCR2013)
-
C-Lock: Energy Efficient Synchronization for Embedded Multicore Systems
[Link]
- Seung Hun Kim, Sang Hyong Lee, Minje Jun, Byunghoon Lee, Won Woo Ro, Eui-Young Chung,
and Jean-Luc Gaudiot
- IEEE Transactions on Computers, Vol. 63, No. 8, pp. 1962-1974, Aug. 2014 (IF: 1.473, Q2, JCR2013)
- Swarm Processor System: Hardware Process Scheduler based Energy Efficient Multi-Core System
[Link]
- Won Seob Jeong, Seung Hun Kim, Sang-Min Lee, and Won Woo Ro
- IEICE Electronics Express, Vol. 11, No. 14, pp. 20140424, July 2014 (IF: 0.391, Q4, JCR2013)
- Complexity-Effective Contention Management with Dynamic Backoff for Transactional Memory Systems
[Link]
- Seung Hun Kim, Dongmin Choi, Won Woo Ro, and Jean-Luc Gaudiot
- IEEE Transactions on Computers, Vol. 63, No. 7, pp. 1696-1708, July 2014 (IF: 1.473, Q2, JCR2013)
- Architectural Investigation of Matrix Data Layout on Multicore Processors
[Link]
- Minwoo Kim and Won Woo Ro
[SCI-Q1]  
Future Generation Computer Systems, Vol. 37, pp. 64-75, July 2014 (IF: 2.639, Q1, JCR2013)
- Exploiting Implementation Diversity and Partial Connection of Routers in Application-Specific Network-on-Chip Topology Synthesis
[Link]
- Minje Jun, Won Woo Ro, and Eui-Young Chung
- IEEE Transactions on Computers, Vol. 63, No. 6, pp. 1434-1445, Jun. 2014 (IF: 1.473, Q2, JCR2013)
- Accelerating MapReduce Framework on Multi-GPU Systems
[Link]
- Hai Jiang, Yi Chen, Zhi Qiao, Kuan-Ching Li, Won Woo Ro, and Jean-Luc Gaudiot
- Cluster Computing, Vol. 17, No. 2, pp. 293-301, Jun. 2014 (IF: 0.949, Q3, JCR2013)
- Boosting CUDA Applications with CPU-GPU Hybrid Computing
[Link]
- Changmin Lee, Won Woo Ro, and Jean-Luc Gaudiot
- International Journal of Parallel Programming, Vol. 42, No. 2, pp. 384-404, Apr. 2014 (IF: 0.500, Q4, JCR2013)
- This is an extension of our INTERACT-16 paper which has been selected as one of the best papers and recommended to IJPP.
Conference Papers
- LUT based Secure Cloud Computing - an Implementation using FPGAs
[Link]
- Lei Xu, Pham Dang Khoa, Seung Hun Kim, Won Woo Ro, and Weidong Shi
- 2014 International Conference on ReConFigurable Computing and FPGAs
- (ReConFig 2014) Cancun, Mexico, Dec. 7 - 10, 2014
- Workload Synthesis: Generating Benchmark Workloads from Statistical Execution Profile
[Link]
- Keunsoo Kim, Changmin Lee, Jung Ho Jung, and Won Woo Ro
- IEEE International Symposium on Workload Characterization
- (IISWC 2014) Raleigh, North Carolina, USA, Oct. 26 - 28, 2014
2013
Journal Papers
- Parallelized Sub-Resource Loading for Web Rendering Engine
[Link]
- Deokho Kim, Changmin Lee, Sangpil Lee, and Won Woo Ro
- Journal of Systems Architecture, Vol. 59, No. 9, pp. 785-793, Oct. 2013 (IF: 0.724, Q3, JCR2012)
- Design and Evaluation of Random Linear Network Coding Accelerators on FPGAs
[Link]
- Sunwoo Kim, Won Seob Jeong, Won Woo Ro, and Jean-Luc Gaudiot
- ACM Transactions on Embedded Computing Systems, Vol.13, No. 1, pp. 1-24, Aug. 2013 (IF: 1.178, Q2, JCR2012)
- GPU-Friendly Parallel Genome Matching with Tiled Access and Reduced State Transition Table
[Link]
- Yunho Oh, Doohwan Oh, and Won Woo Ro
- International Journal of Parallel Programming, Vol. 41, No. 4, pp. 526-551, Aug. 2013 (IF: 0.404, Q4, JCR2012)
- A Distributed Signature Detection Method for Detecting Intrusions in Sensor Systems
[Link]
- Ilkyu Kim, Doohwan Oh, Myung Kuk Yoon, Kyueun Yi, and Won Woo Ro
- Sensors, Vol. 13, No. 4, pp. 3998-4016, Mar. 2013 (IF: 1.953, Q3, JCR2012)
- Exploiting SIMD Parallelism on Dynamically Partitioned Parallel Network Coding for P2P Systems
[Link]
- Deokho Kim, Karam Park, and Won Woo Ro
- Computers & Electrical Engineering, Vol. 39, No. 1, pp. 55-56, Jan. 2013 (IF: 0.928, Q3, JCR2012)
- Benefits of Using Parallelized Non-Progressive Network Coding
[Link]
- Minwoo Kim, Karam Park, and Won Woo Ro
[SCI-Q1]  
Journal of Network and Computer Applications, Vol. 36, No. 1, pp. 293-305, Jan. 2013 (IF: 1.467, Q1, JCR2012)
- Importance of Coherence Protocols with Network Applications on Multi-Core Processors
[Link]
- Kyueun Yi, Won Woo Ro, and Jean-Luc Gaudiot
- IEEE Transactions on Computers, Vol. 62, No. 1, pp. 6-15, Jan. 2013 (IF: 1.379, Q2, JCR2012)
Conference Papers
- XSD: Accelerating MapReduce by Harnessing the GPU inside an SSD
[Link]
- Benjamin Y. Cho, Won Seob Jeong, Doohwan Oh, and Won Woo Ro
- The 1st Workshop on Near-Data Processing. In conjunction with the MICRO-46
- (WoNDP 2013) Davis, USA, Dec. 8, 2013
- Mark-Sharing: A Parallel Garbage Collection Algorithm for Low Synchronization Overhead
[Link]
- Hyunkyu Park, Changmin Lee, Seung Hun Kim, Won Woo Ro and Jean-Luc Gaudiot
- The 19th IEEE International Conference on Parallel and Distributed Systems
- (ICPADS 2013) Seoul, Korea, Dec. 15 - 18, 2013
- MGMR: Multi-GPU Based MapReduce
[Link]
- Yi Chen, Zhi Qiao, Hai Jiang, Kuan-Ching Li, Won Woo Ro
- The 8th International Conference on Grid and Pervasive Computing
- (GPC 2013) Seoul, Korea, May. 9 - 11, 2013
- Parallel GPU Architecture Simulation Framework Exploiting Work Allocation Unit Parallelism
[Link]
- Sangpil Lee and Won Woo Ro
- The 2013 IEEE International Symposium on Performance Analysis of Systems and Software
- (ISPASS 2013) Austin, TX, USA, Apr. 21 - 23, 2013
2012
Journal Papers
- Multi-Threading and Suffix Grouping on Massive Multiple Pattern Matching Algorithm
[Link]
- Doohwan Oh and Won Woo Ro
- The Computer Journal, Vol. 55, No. 11, pp. 1331-1346, Nov. 2012 (IF: 0.785, Q3, JCR2011)
- Offloading of Media Transcoding for High-Quality Multimedia Services
[Link]
- Seung Hun Kim, Keunsoo Kim, Changmin Lee, and Won Woo Ro
- IEEE Transactions on Consumer Electronics, Vol. 58, No. 2, pp. 691-699, May 2012 (IF: 0.941, Q3, JCR2011)
- Design of a Power-Efficient Parallel Pipelined Bloom Filter
[Link]
- Deokho Kim, Doohwan Oh, and Won Woo Ro
- Electronics Letters, Vol. 48, No. 7, pp. 367-369, Mar. 2012 (IF: 0.965, Q3, JCR2011)
- Reconfigurable and Parallelized Network Coding Decoder for VANETs
[Link]
- Sunwoo Kim and Won Woo Ro
[SCI-Q1]  
Mobile Information Systems, Vol. 8, No. 1, pp. 45-59, Feb. 2012 (IF: 2.432, Q1, JCR2011)
- Accelerated Network Coding with Dynamic Stream Decomposition on Graphics Processing Unit
[Link]
- Sangpil Lee and Won Woo Ro
- The Computer Journal, Vol. 55, No. 1, pp. 21-34, Jan. 2012 (IF: 0.785, Q3, JCR2011)
Conference Papers
- Conflict Avoidance Scheduling using Grouping List for Transactional Memory
[Link]
- Dongmin Choi, Seung Hun Kim, and Won Woo Ro
- The 17th International Workshop on High-Level Parallel Programming Models and Supportive Environments
- (HIPS-17) Shanghai, China, May 21, 2012
- Cooperative Heterogeneous Computing for Parallel Processing on CPU/GPU Hybrids
[Link]
- Changmin Lee, Won Woo Ro, and Jean-Luc Gaudiot
- The 16th Workshop on Interaction between Compilers and Computer Architectures
- (INTERACT-16) New Orleans, USA, Feb. 25 - 29, 2012
2011
Journal Papers
- A Novel Sequential Tree Algorithm Based on Scoreboard for MPI Broadcast Communication
[Link]
- Won-young Chung, Jae-won Park, Seung-Woo Lee, Won Woo Ro, and Yong-surk Lee
- IEICE Transactions on Information and Systems, Vol 94, No. 12, pp. 2523-2527, December. 2011 (IF: 0.268, Q4, JCR2010)
- Network Coding on Heterogeneous Multi-Core Processors for Wireless Sensor Networks
[Link]
- Deokho Kim, Karam Park, and Won W. Ro
- Sensors, Vol 11, No. 8, pp. 7908-7933, Aug. 2011 (IF: 1.774, Q3, JCR2010)
- A Low-Cost Standard Mode MPI Hardware Unit for Embedded MPSoC
[Link]
- Won-Young Chung, Ha-Young Jeong, Won W. Ro, and Yong-Surk Lee
- IEICE Transactions on Information and Systems, Vol. E94-D, No.7, pp. 1497-1501, July 2011 (IF: 0.268, Q4, JCR2010)
Conference Papers
- Parallel Transpose of Matrix Multiplication Based on the Tiling Algorithm
[Link]
- Minwoo Kim, Yong J. Jang, and Won W. Ro
- The 54th IEEE International Midwest Symposium on Circuits and Systems
- (MWSCAS 2011) Seoul, Korea, Aug. 7 - 10, 2011
- Performance Evaluation of Adaptive Progressive Network Coding
[Link]
- Deokho Kim, Karam Park, and Won W. Ro
- The 54th IEEE International Midwest Symposium on Circuits and Systems
- (MWSCAS 2011) Seoul, Korea, Aug. 7 - 10, 2011
2010
Journal Papers
- Multithreaded Pattern Matching Algorithm with Data Rearrangement
[Link]
- Doohwan Oh, Seung Hun Kim, and Won W. Ro
- IEICE Electronics Express, Vol. 7, No. 20, pp. 1520-1526, Oct. 2010 (IF: 0.510, Q3, JCR2009)
- On Improving Parallelized Network Coding with Dynamic Partitioning
[Link]
- Karam Park, Joon-Sang Park, and Won W. Ro
[SCI-Q1]  
IEEE Transactions on Parallel and Distributed Systems, Vol. 21, No. 11, pp. 1547-1560, Nov. 2010 (IF: 1.733, Q1, JCR2009)
- Hardware Implementation of a Tessellation Accelerator for the OpenVG Standard
[Link]
- Seung Hun Kim, Yunho Oh, Karam Park, and Won W. Ro
- IEICE Electronics Express, Vol. 7, No. 6, pp. 440-446, Mar. 2010 (IF: 0.510, Q3, JCR2009)
Conference Papers
- Development of Virtual CUDA Systems of Parallel Processing on CPU and GPGPU
[Link]
- Doohwan Oh, Sangpil Lee, Deokho Kim, Changmin Lee, and Won W. Ro
- Workshop on Micro Architectural Support for Virtualization, Data Center Computing, and Clouds In Conjunction with MICRO 2010
- (MASVDC Workshop 2010) Atlanta, USA, Dec. 5, 2010
- Implementing FFT using SPMD style of OpenMP
[Link]
- Tien-Hsiung Weng, Sheng-Wei Huang, Won Woo Ro, and Kuan-Ching Li
- In Proc. of the 6th International Conference on Networked Computing and Advanced Information Management
- (NCM 2010) Seoul, Korea, Aug. 16 - 18, 2010
- Accelerated Reconstruction Using Parallel Computing for Spiral Spectroscopic Imaging
[Link]
- Dong H. Kim, Yun H. Oh, Yun H. Nam, M. Gu, and Won W. Ro
- In Proc. of 2010 International Society for Magnetic Resonance in Medicine Annual Meeting
- (2010 ISMRM Annual Meeting) Stockholm, Sweden, May 1 - 7, 2010
- FPGA Implementation of Highly Parallelized Decoder Logic for Network Coding
[Link]
- Sunwoo Kim and Won W. Ro
- In Proc. of Eighteenth ACM/SIGDA International Symposium on Field-Programmable Gate Arrays
- (FPGA 2010) Monterey, USA, Feb. 21 - 23, 2010
2009
Journal Papers
- A Complexity-Effective Microprocessor Design with Decoupled Dispatch Queues and Prefetching
[Link]
- Won W. Ro and Jean-Luc Gaudiot
- Parallel Computing, Vol. 35, No. 5, pp. 255-268, May 2009 (IF: 1.309, Q2, JCR2008)
Conference Papers
- Evaluation of Cache Coherence Protocols on Multi-Core Systems with Linear Workloads
[Link]
- Yong J. Jang and Won W. Ro
- In Proc. of 2009 International Colloquium on Computing, Communication, Control, and Management
- (CCCM 2009)Sanya, China, Aug. 8 - 9, 2009
- Comparing Open Source Web Services: gSoap and AXIS
[Link]
- Jongwook Woo and Won W. Ro
- In Proc. of the 24th International Technical Conference on Circuits/Systems, Computers and Communications
- (ITC-CSCC 2009)Jeju Island, Korea, July 5 - 8, 2009
- Efficient Parallelized Network Coding for P2P File Sharing Applications
[Link]
- Karam Park, Joon-Sang Park, and Won W. Ro
- In Proc. of the 4th International Conference on Grid and Pervasive Computing
- (GPC 2009)Geneva, Switcherland, May 4 - 8, 2009
- Fully Pipelined Hardware Implementation of 128-bit SEED Block Cipher Algorithm
[Link]
- Jaeyoung Yi, Karam Park, Joonseok Park, and Won W. Ro
- In Proc. of the 5th International Workshop on Applied Reconfigurable Computing
- (ARC 2009)Karlsruhe, Germany, Mar. 16 - 18, 2009
Book Chapters
- Programmability and Scalability on Multi-Core Architectures
[Link]
- Jaeyoung Yi, Yong J. Jang, Doohwan Oh, and Won W. Ro
- Chapter in "Handbook of Research on Scalable Computing Technologies", edited by Kuan-Ching Li, Ching-Hsien Hsu, Laurence Tianruo Yang, Jack Dongarra, and Hans Zima, Information Science Reference, 2009
2008
Journal Papers
- Efficient Peer-to-Peer File Sharing Using Network Coding in MANET
[Link]
- Uichin Lee, Joon-Sang Park, Seung-Hoon Lee, Won W. Ro, Giovanni Pau, and Mario Gerla
- Journal of Communications and Networks, Vol. 10, No. 4, Dec. 2008 (IF: 0.223, Q4, JCR2007)
- A Low-Complexity Microprocessor Design with Speculative Pre-Execution
[Link]
- Won W. Ro and Jean-Luc Gaudiot
- Journal of Systems Architecture, Vol. 54, No. 12, pp. 1101-1112, Dec. 2008 (IF: 0.490, Q3, JCR2007)
- Performance Evaluation of Programming Models for SMP-Based Clusters
[Link]
- Myungho Lee, Neungsoo Park, Won W. Ro, and Kuan-Ching Li
- Journal of the Chinese Institute of Engineers, Vol. 31, No. 7, pp. 1181-1188, Dec. 2008 (IF: 0.183, Q4, JCR2007)
- Simultaneous Thin-Thread Processors for Low-Power Embedded Systems
[Link]
- Won W. Ro, Jaeyoung Yi, Joon-Sang Park, and Joonseok Park
- IEICE Electronics Express, Vol. 5, No. 19, pp. 802-808, Oct. 2008 (IF: 0.436, Q3, JCR2007)
- Delay Analysis of Car-to-Car Reliable Data Delivery Strategies Based on Data Mulling with Network Coding
[Link]
- Joon-Sang Park, Uichin Lee, Soon Young Oh, Mario Gerla, Desmond Siumen Lun, Won W. Ro, and Joonseok Park
- IEICE Transactions on Information and Systems, Vol. E91-D, No. 10, Oct. 2008 (IF: 0.245, Q4, JCR2007)
Conference Papers
- Parallel Algorithms for Steiner Tree Problem
[Link]
- Joon-Sang Park, Won W. Ro, Handuck Lee, and Neungsoo Park
- In Proc. of the 3rd International Conference on Convergence and Hybrid Information Technology
- (ICHIT 2008)Busan, Korea, Nov. 11 - 13, 2008
2006
Journal Papers
- Design and Evaluation of a Hierarchical Decoupled Architecture
[Link]
- Won W. Ro, Stephen P. Crago, Alvin M. Despain, and Jean-Luc Gaudiot
- Journal of Supercomputing, Springer, Vol. 38, No. 3, pp. 237-259, Dec. 2006 (IF: 0.482, Q3, JCR2005)
- Speculative Pre-Execution Assisted by Compiler (SPEAR)
[Link]
- Won W. Ro and Jean-Luc Gaudiot
- Journal of Parallel and Distributed Computing, Elsevier, Vol. 66, No. 8, pp. 1076-1089, Aug. 2006 (IF: 0.900, Q2, JCR2005)
Conference Papers
- Design and Effectiveness of Small-Sized Decoupled Dispatch Queues
[Link]
- Won W. Ro and Jean-Luc Gaudiot
- In Proc. of European Conference on Parallel Computing - LNCS
- (EURO-PAR 2006) Dresden, Germany, Aug. 29 - Sep. 1, 2006
2005
Conference Papers
- A Low-Complexity Issue Queue Design with Speculative Pre-Execution
[Link]
- Won W. Ro and Jean-Luc Gaudiot
- In Proc. of the 12th International Conference on High Performance Computing
- (HiPC 2005) Goa, India, Dec. 18 - 21, 2005
Book Chapters
- Techniques to Improve Performance Beyond Pipelining: Superpipelining, Superscalar, and VLIW
[Link]
- Jean-Luc Gaudiot, Jung-Yup Kang, and Won Woo Ro
- Chapter in "Computer Architecture", a volume of "Advance in Computers", edited by Ali R.Hurson, Elsevier, 2005
2004
Conference Papers
- SPEAR: A Hybrid Model for Speculative Pre-Execution
[Link]
- Won W. Ro and Jean-Luc Gaudiot
- In Proc. of the 18th International Parallel and Distributed Processing Symposium
- (IPDPS 2004)Santa Fe, New Mexico, 2004
2003
Conference Papers
- HiDISC: A Decoupled Architecture for Data-Intensive Applications
[Link]
- Won W. Ro, Jean-Luc Gaudiot, Stephen P. Crago, and Alvin M. Despain
- In Proc. of the 17th International Parallel and Distributed Processing Symposium
- (IPDPS 2003)Nice, France, Apr. 22 - 26, 2003
- Compiler Support for Dynamic Speculative Pre-Execution
[Link]
- Won W. Ro and Jean-Luc Gaudiot
- In Proc. of the 7th Annual Workshop on Interaction between Compilers and Computer Architectures
- (INTERACT-7) in conjunction with HPCA-9 Anaheim, California, Feb. 8, 2003
2000
Conference Papers
- Memory Latency: to Tolerate or to Reduce?
[Link]
- Amol Bakshi, Jean-Luc Gaudiot, Wen-Yen Lin, Manil Makhija, Viktor K. Prasanna, Wonwoo Ro, and Chulho Shin
- In Proc. of the 12th Symposium on Computer Architecture and High Performance Computing
- (SBAC-PAD'00) Sao Pedro, Brazil, Oct. 24 - 27, 2000
- A High-Performance, Hierarchical Decoupled Architecture
[Link]
- Stephen P. Crago, Alvin Despain, Jean-Luc Gaudiot, Manil Makhija, Wonwoo Ro, and Apoorv Srivastava
- In Proc. of the Memory access Decoupling for superscalar and multiple issue Architectures
- (MEDEA) Workshop in conjunction with PACT 2000 Philadelphia, Oct. 15, 2000
- A Reliable Cluster Computing with a New Checkpointing RAID-x Architecture
[Link]
- Kai Hwang, Hai Jin, Roy Ho, and Wonwoo Ro
- In Proc. of the 9th Heterogeneous Computing Workshop
- (HCW) Cancun, Mexico, May 1, 2000
All Publications
2025
In Press
Conference Papers
- Ditto: Accelerating Diffusion Model via Temporal Value Similarity
- Sungbin Kim*, Hyunwuk Lee*, Wonho Cho, Mincheol Park, and Won Woo Ro
- The 31st IEEE International Symposium on High-Performance Computer
- (HPCA 2025)
- Marching Page Walks: Batching and Concurrent Page Table Walks for Enhancing GPU Throughput
- Jiwon Lee, Gun Ko, Myung Kuk Yoon, Ipoom Jeong, Yunho Oh, and Won Woo Ro
- The 31st IEEE International Symposium on High-Performance Computer
- (HPCA 2025)
- Qubit Movement-Optimized Program Generation on Zoned Neutral Atom Processors
- Enhyeok Jang, Youngmin Kim, Hyungseok Kim, Seungwoo Choi, Yipeng Huang, and Won Woo Ro
- The IEEE/ACM International Symposium on Code Generation and Optimization
- (CGO 2025)
- PIMutation: Exploring the Potential of Real PIM Architecture for Quantum Circuit Simulation
- Dongin Lee, Enhyeok Jang, Seungwoo Choi, Junwoong An, Cheolhwan Kim, and Won Woo Ro
- The 30th Asia and South Pacific Design Automation Conference
- (ASP-DAC 2025)
2024
Journal Papers
- SHREG: Mitigating Register Redundancy in GPUs
- Seunghyun Jin, Hyunwuk Lee, Jonghyun Lee, Junsung Kim, and Won Woo Ro
- Journal of Systems Architecture Vol. 152, July. 2024
Conference Papers
- DEPrune: Depth-wise Separable Convolution Pruning for Maximizing GPU Parallelism
- Cheonjun Park, Mincheol Park, Hyunchan Moon, Myung Kuk Yoon, Seokjin Go, Suhyun Kim, and Won Woo Ro
- The 38th Annual Conference on Neural Information Processing Systems
- (NeurIPS 2024)
- AirGun: Adaptive Granularity Quantization for Accelerating Large Language Models
- Sungbin Kim, Hyunwuk Lee, Sungwoo Kim, Cheolhwan Kim, and Won Woo Ro
- The International Conference on Computer Design
- (ICCD 2024)
- MOSQ: Accelerating Classical Simulation of UCCSD Ansatz Circuits using Merged Operation
- Seungwoo Choi, Enhyeok Jang, Youngmin Kim, and Won Woo Ro
- The International Conference on Computer Design
- (ICCD 2024)
- Generalizing Ray Tracing Accelerators for Tree Traversals on GPUs
- Dongho Ha*, Lufei Liu*, Yuan Hsi Chou, Seokjin Go, Won Woo Ro, Hung-Wei Tseng, and Tor M. Aamodt
- The International Symposium on Microarchitecture
- (MICRO 2024)
- Barber: Balancing Thermal Relaxation Deviations of NISQ Programs by Exploiting Bit-Inverted Circuits
- Enhyeok Jang, Seungwoo Choi, Youngmin Kim, Jeewoo Seo, and Won Woo Ro
- The 2024 ACM/IEEE International Conference on Computer-Aided Design
- (ICCAD 2024)
- Recompiling QAOA Circuits on Various Rotational Directions
- Enhyeok Jang, Dongho Ha, Seungwoo Choi, Youngmin Kim, Jaewon Kwon, Yongju Lee, Sungwoo Ahn, Hyungseok Kim, and Won Woo Ro
- The 33rd International Conference on Parallel Architectures and Compilation Techniques
- (PACT 2024)
- M3XU: Achieving High-Precision and Complex Matrix Multiplication with Low-Precision MXUs
- Dongho Ha, Yunan Zhang, Chen-Chien Kao, Christopher J. Hughes, Won Woo Ro, and Hung-Wei Tseng
- The International Conference for High Performance Computing, Networking, Storage, and Analysis
- (SC 2024)
- GUMSO: Gating Unnecessary On-Chip Memory Slices for Power Optimization on GPUs
- Seunghyun Jin, Hyunwuk Lee, and Won Woo Ro
- The 2024 ACM/IEEE International Symposium on Low Power Electronics and Design
- (ISLPED 2024)
- Geneva: A Dynamic Confluence of Speculative Execution and In-Order Commitment Windows
- Yanghee Lee, Jiwon Lee, Jaewon Kwon, Yongju Lee, and Won Woo Ro
- The 61th Design Automation Conference
- (DAC 2024)
- Systolic Array Architecture Supporting Multiple Scaling Factors for U-Net Quantization
- Hyunwuk Lee and Won Woo Ro
- The 23th International Conference on Electronics, Information, and Communication
- (ICEIC 2024)
- Evaluating Performance of Shared On-Chip Caches in Multi-GPUs
- Gun Ko and Won Woo Ro
- The 23th International Conference on Electronics, Information, and Communication
- (ICEIC 2024)
- A Multi-DNN Acceleration Architecture for Balanced QoS and Throughput
- Ipoom Jeong, Sungji Choi, Minjae Kim, Enhyeok Jang, Seokjin Go, and Won Woo Ro
- The 23th International Conference on Electronics, Information, and Communication
- (ICEIC 2024)
- Integrated Framework Design Methodologies to Support Processing-In-Memory Platforms
- Enhyeok Jang, Hongju Kal, Jaewon Kwon, and Won Woo Ro
- The 23th International Conference on Electronics, Information, and Communication
- (ICEIC 2024)
- REPrune: Channel Pruning via Kernel Representative Selection
- Mincheol Park, Dongjin Kim, Cheonjun Park, Yuna Park, Gyeong Eun Gong, Won Woo Ro, and Suhyun Kim
- The 38th AAAI Conference on Artificial Intelligence
- (AAAI 2024)
2023
Journal Papers
- A Convertible Neural Processor Supporting Adaptive Quantization for Real-Time Neural Networks
- Hongju Kal, Hyoseong Choi, Ipoom Jeong, Joon-Sung Yang, and Won Woo Ro
- Journal of Systems Architecture Vol. 145, Nov. 2023
Conference Papers
- INTERPRET: Inter-Warp Register Reuse for GPU Tensor Cores
- Jae Seok Kwak, Myung Kuk Yoon, Ipoom Jeong, Seunghyun Jin, and Won Woo Ro
- The 32th International Conference on Parallel Architectures and Compilation Techniques
- (PACT 2023)
- McCore: A Holistic Management of High-Performance Heterogeneous Multicores
- Jaewon Kwon, Yongju Lee, Hongju Kal, Minjae Kim, Youngsok Kim, and Won Woo Ro
- The 56th International Symposium on Microarchitecture
- (MICRO 2023)
- AESPA: Asynchronous Execution Scheme to Exploit Bank-Level Parallelism of Processing-in-Memory
- Hongju Kal, Chanyoung Yoo, and Won Woo Ro
- The 56th International Symposium on Microarchitecture
- (MICRO 2023)
- MAD MAcce: Supporting Multiply-Add Operations for Democratizing Matrix-Multiplication Accelerator
- Seunghwan Sung, Sujin Hur, Dongho Ha, Sungwoo Kim, Yunho Oh, and Won Woo Ro
- The 56th International Symposium on Microarchitecture
- (MICRO 2023)
- Exploiting Inherent Properties of Complex Numbers for Accelerating Complex Valued Neural Networks
- Hyunwuk Lee, Hyungjun Jang, Sungbin Kim, Sungwoo Kim, Wonho Cho, and Won Woo Ro
- The 56th International Symposium on Microarchitecture
- (MICRO 2023)
- Performance Analysis of Criticality-Aware Out-of-Order Cores for Exploiting MLP
- Yanghee Lee, Jiwon Lee, and Won Woo Ro
- The 38th International Technical Conference on Circuits/Systems, Computers and Communications
- (ITC-CSCC 2023)
- Adaptive Data Prefetcher with Probability Learning in LLC
- Jusin Kim, Jiwon Lee, and Won Woo Ro
- The 38th International Technical Conference on Circuits/Systems, Computers and Communications
- (ITC-CSCC 2023)
- Context Swap: Multi-PIM System Preventing Remote Memory Access for Large Embedding Model Acceleration
- Hongju Kal, Cheolhwan Kim, Minjae Kim, and Won Woo Ro
- The 2023 IEEE International Conference on Artificial Intelligence Circuits and Systems
- (AICAS 2023)
- TensorCV: Accelerating Non-AI/ML Stages in Computing Vision Pipelines using Tensor Processors
- Dongho Ha, Won Woo Ro, and Hung-Wei Tseng
- The 2023 ACM/IEEE International Symposium on Low Power Electronics and Design
- (ISLPED 2023)
- R2D2: Removing ReDunDancy Utilizing Linearity of Address Generation in GPUs
- Dongho Ha, Yunho Oh, and Won Woo Ro
- The 50th International Symposium on Computer Architecture
- (ISCA 2023)
- Early-Adaptor: An Adaptive Framework for Proactive UVM Memory Management
- Seokjin Go, Hyunwuk Lee, Junsung Kim, Jiwon Lee, Myung Kuk Yoon, and Won Woo Ro
- The 2023 IEEE International Symposium on Performance Analysis of Systems and Software
- (ISPASS 2023)
- Lightning Talk: Efficiency and Programmability of DNN Accelerators and GPUs
- Won Woo Ro
- The 60th ACM/IEEE Design Automation Conference
- (DAC 2023)
- Quixote: Improving Fidelity of Quantum Program by Independent Execution of Controlled Gates
- Enhyeok Jang, Seungwoo Choi, and Won Woo Ro
- The 60th ACM/IEEE Design Automation Conference
- (DAC 2023)
- Balanced Column-Wise Block Pruning for Maximizing GPU Parallelism
- Cheonjun Park, Mincheol Park, Hyun Jae Oh, Minkyu Kim, Myung Kuk Yoon, Suhyun Kim, and Won Woo Ro
- The 37th AAAI Conference on Artificial Intelligence
- (AAAI 2023)
- SnakeByte: A TLB Design with Adaptive and Recursive Page Merging in GPUs
- Jiwon Lee, Ju Min Lee, Yunho Oh, William J. Song, and Won Woo Ro
- The 29th IEEE International Symposium on High-Performance Computer
- (HPCA 2023)
- Analysis on Memory Access Patterns of Server-Class Workloads in Page- and Cache Line- Granularity
- Kyeonghoon Lim, Minjae Kim, Jiwon Lee, and Won Woo Ro
- The 22th International Conference on Electronics, Information, and Communication
- (ICEIC-2023)
- Enabling Heterogeneous Memory System over CXL
- Dongin Lee, Sungbin Kim, Hyungjun Jang, Sungwoo Kim, and Won Woo Ro
- The 22th International Conference on Electronics, Information, and Communication
- (ICEIC-2023)
- Investigation on NVIDIA Ampere GPU Architecture with Reverse Engineering
- Sujin Hur, Seunghwan Sung, Dongho Ha, Sungwoo Kim, and Won Woo Ro
- The 22th International Conference on Electronics, Information, and Communication
- (ICEIC-2023)
2022
Journal Papers
- TEA-RC: Thread Context-Aware Register Cache for GPUs
- Ipoom Jeong, Yunho Oh, Won Woo Ro, and Myung Kuk Yoon
- Accepted to IEEE Access
- CASH-RF: A Compiler-Assisted Hierarchical Register File in GPUs
- Yunho Oh, Ipoom Jeong, Won Woo Ro, and Myung Kuk Yoon
- Accepted to IEEE Embedded Systems Letters
- (IEEE ESL)
- FLIXR: Embedding Index into Flash Translation Layer in SSDs
- Gunjae Koo, Yunho Oh, Hung-Wei Tseng, Won Woo Ro, and Murali Annavaram
- Accepted to IEEE Transactions on Computers
Conference Papers
- 다종의 프로세싱 인 메모리 구조를 활용하기 위한 BLAS 기반의 프레임 워크 구현
- 유찬영, 장은혁, 갈홍주, 노원우
- 대한전자공학회 추계학술대회
- Reconstructing Out-of-Order Issue Queue
- Ipoom Jeong, Jiwon Lee, Myung Kuk Yoon, and Won Woo Ro
- The 55th IEEE/ACM International Symposium on Microarchitecture
- (MICRO 2022)
- Analysis of SSD with Logical to Physical Address Mapping of Hot Data to Single Level Cell Area
- Gyuseok Choe, Youngmin Lee, and Won Woo Ro
- The 37th International Technical Conference on Circuits/Systems, Computers and Communications
- (ITC-CSCC 2022)
- Analysis of DRAM-based Network of DRAM Swap Space Adopting Latency Hiding Technique
- Hyoseong Choi, Jiwon Lee, Jeonghoon Choi, and Won Woo Ro
- The 37th International Technical Conference on Circuits/Systems, Computers and Communications
- (ITC-CSCC 2022)
- PR3D: Processing Recommendation Systems in 3D-Stacked DRAM Adopting Heterogeneous Data Format
- Chanyoung Yoo, Hongju Kal, and Won Woo Ro
- The 21th International Conference on Electronics, Information, and Communication
- (ICEIC-2022)
2021
Journal Papers
- Two-Stage In-Storage Processing and Scheduling for Pattern Matching Applications
- Joohyeong Yoon, Yoonjin Lee, Won Seob Jeong, and Won Woo Ro
- IEEE Access, Vol. 9, pp. 95702-95715, Jun. 2021
- PIMCaffe: Functional Evaluation of a Machine Learning Framework for In-Memory Neural Processing Unit
- Won Jeon, Jiwon Lee, Dongseok Kang, Hongju Kal, and Won Woo Ro
- IEEE Access, Vol. 9, pp. 96629-96640, Jul. 2021
Conference Papers
- SPACE: Locality-Aware Processing in Heterogeneous Memory for Personalized Recommendations
- Hongju Kal, Seokmin Lee, Gun Ko, and Won Woo Ro
- The 48th ACM/IEEE International Symposium on Computer Architecture
- (ISCA-2021)
- Analysis of GPU Scheduling Technique for Convergence Barrier
- Jae Seok Kwak and Won Woo Ro
- The 20th International Conference on Electronics, Information, and Communication
- (ICEIC-2021)
- Delay Analysis on Tensor Access Patterns of CNN Algorithms
- Jonathan Robert Malin and Won Woo Ro
- The 20th International Conference on Electronics, Information, and Communication
- (ICEIC-2021)
- Detecting Pattern of Warp Register Value Differences in CTA using GPU Compiler
- Dongho Ha and Won Woo Ro
- The 20th International Conference on Electronics, Information, and Communication
- (ICEIC-2021)
- Analysis of Multiple-Application Support Techniques in GPU
- Jonghyun Lee and Won Woo Ro
- The 6th International Conference On Consumer Electronics (ICCE) Asia
- (ICCE-ASIA 2021)
- Analysis of Key-Value SSD to Improve the Performance of Key-Value Store System
- Gyuseok Choe, Jeonghoon Choi and Won Woo Ro
- The 6th International Conference On Consumer Electronics (ICCE) Asia
- (ICCE-ASIA 2021)
- QoS-Aware Scheduling for Cellular Networks Using Deep Reinforcement Learning
- Jonathan Robert Malin, Gun Ko and Won Woo Ro
- The 18th IFIP International Conference on Network and Parallel Computing
- (NPC 2021)
2020
Journal Papers
- Hi-End: Hierarchical, Endurance-Aware STT-MRAM-Based Register File for Energy-Efficient GPUs
- Won Jeon, Jun Hyun Park, Yoonsoo Kim, Gunjae Koo, and Won Woo Ro
- IEEE Access, Vol. 8, pp. 127768-127780, Jul. 2020
- REACT: Scalable and High-Performance Regular Expression Pattern Matching Accelerator for In-Storage Processing
- Won Seob Jeong, Changmin Lee, Keunsoo Kim, Myung Kuk Yoon, Won Jeon, Myoungsoo Jung, and Won Woo Ro
- IEEE Transactions on Parallel and Distributed Systems, Vol. 31, Issue 5, pp.1137-1151, May. 2020
Conference Papers
- BENEFIT: Basic Linear Algebra Subprogram and Neural Network framework for FPGA-based Neural Processing Units
- Dongseok Kang and Won Woo Ro
- The Fifth International Conference On Consumer Electronics Asia
- (ICCE-ASIA 2020)
- Busan, Korea, Nov. 1 - 3, 2020
- OASIS: Overhead Analysis of Systolic Neural Processing Unit on LSTM
- Byunghwy Choi and Won Woo Ro
- The Fifth International Conference On Consumer Electronics Asia
- (ICCE-ASIA 2020)
- Busan, Korea, Nov. 1 - 3, 2020
- Interaction Data Analysis for Personalized Recommendation System
- Seokmin Lee and Won Woo Ro
- The Fifth International Conference On Consumer Electronics Asia
- (ICCE-ASIA 2020)
- Busan, Korea, Nov. 1 - 3, 2020
- BODCA: Heterogeneous CPU-GPU computing system with Bandwidth-Optimized DRAM cache design
- Sungji Choi and Won Woo Ro
- The Fifth International Conference On Consumer Electronics Asia
- (ICCE-ASIA 2020)
- Busan, Korea, Nov. 1 - 3, 2020
- Duplo: Lifting Redundant Memory Accesses of Neural Networks for GPU Tensor Cores
- Hyeonjin Kim, Sungwoo Ahn, Yunho Oh, Bogil Kim, Won Woo Ro, and William J. Song
- The 53rd IEEE/ACM International Symposium on Microarchitecture
- (MICRO 2020)
- Virutal Conference, Oct. 17 - Oct. 21, 2020
- Check-In: In-Storage Checkpointing for Key-Value Store System Leveraging Flash-Based SSDs
- Joohyeong Yoon, Won Seob Jeong, and Won Woo Ro
- The 47th ACM/IEEE International Symposium on Computer Architecture
- (ISCA 2020)
- Virutal Conference, May. 29 - Jun. 3, 2020
- CASINO Core Microarchitecture: Generating Out-of-Order Schedules Using Cascaded In-Order Scheduling Windows
- Ipoom Jeong, Seihoon Park, Changmin Lee, and Won Woo Ro
- The 26th International IEEE Symposium on High Performance Computer Architecture
- (HPCA 2020)
- San Diego, CA, USA, Feb. 22 - 26, 2020
- Self-controllable refresh target row skip and inclusion technique for the intelligent DRAM
- Jaein Song and Won Woo Ro
- The 19th International Conference on Electronics, Information and Communication
- (ICEIC 2020)
- Access Characteristic-based Cache Replacement Policy in an SSD
- Joohyeong Yoon and Won Woo Ro
- The 19th International Conference on Electronics, Information and Communication
- (ICEIC 2020)
2019
Journal Papers
- OverCome: Coarse-Grained Instruction Commit with Handover Register Renaming
- Ipoom Jeong, Changmin Lee, Keunsoo Kim, and Won Woo Ro
- IEEE Transactions on Computers, Vol. 68, Issue 12, pp. 1802-1816, Dec. 2019
- Contents-Aware Partitioning Algorithm for Parallel High Efficiency Video Coding
- Kyungah Kim and Won Woo Ro
- Multimedia Tools and Applications, Multimedia Tools and Applications, Vol. 78, Issue 9, pp. 11427-11442, May. 2019
- Fast CU Depth Decision for HEVC using Neural Networks
- Kyungah Kim and Won Woo Ro
- IEEE Transactions on Circuits and Systems for Video Technology, Vol. 29, No. 5, pp. 1462-1473, May. 2019
- Adaptive Cooperation of Prefetching and Warp Scheduling on GPUs
- Yunho Oh, Keunsoo Kim, Myung Kuk Yoon, Jong Hyun Park, Yongjun Park, Murali Annavaram, and Won Woo Ro
- IEEE Transactions on Computers, Vol. 68, No. 4, pp. 609-616, Apr. 2019
Conference Papers
- Efficient Dilated-Winograd Convolutional Neural Networks
- Minsik Kim, Cheonjun Park, Sungjun Kim, Taeyoung Hong, and Won Woo Ro
- The 2019 IEEE International Conference on Image Processing, Accepted
- Performance Scalability Limit of PARSEC Benchmark on a Many-Core Processor
- Won Seob Jeong and Won Woo Ro
- The 34th International Technical Conference on Circuits/Systems, Computers and Communications
- (ITC-CSCC 2019)
- Jeju, Korea, Jun. 23 - 26, 2019
- Analysis of SSD Internal DRAM Sensitivity for a Key-Value Store
- Yongseok Won, Yoonjin Lee, Won Seob Jeong, and Won Woo Ro
- The 34th International Technical Conference on Circuits/Systems, Computers and Communications
- (ITC-CSCC 2019)
- Jeju, Korea, Jun. 23 - 26, 2019
- Exploiting GPU hierarchical TLB in Multi-Application Execution
- Hyun Jae Oh, Won Jeon, and Won Woo Ro
- The 34th International Technical Conference on Circuits/Systems, Computers and Communications
- (ITC-CSCC 2019)
- Jeju, Korea, Jun. 23 - 26, 2019
- Hierarchical, Compressed STT-MRAM Register File for GPU
- Jun Hyun Park and Won Woo Ro
- The 34th International Technical Conference on Circuits/Systems, Computers and Communications
- (ITC-CSCC 2019)
- Jeju, Korea, Jun. 23 - 26, 2019
- Linebacker: Preserving Victim Cache Lines in Idle Register Files of GPUs
- Yunho Oh, Gunjae Koo, Murali Annavaram, and Won Woo Ro
- The 46th ACM/IEEE International Symposium on Computer Architecture
- (ISCA 2019)
- Phoenix, Arizona, USA, Jun. 22 - 26, 2019
- Analysis of SSD Internal Cache Problem in a Key-Value Store System
- Won Seob Jeong, Yongseok Won, and Won Woo Ro
- The 2nd International Conference on Big Data and Smart Computing
- (ICBDSC 2019)
- Bali, Indonesia. Jan. 10 - 13, 2019
2018
Journal Papers
- 고성능 그래픽 처리 장치 발전 동향
- 하동호, 이현욱, 이지원, 오현재, 전원, 오윤호, 노원우
- 한국정보과학회 정보과학회지
- WASP: Selective Data Prefetching with Monitoring Runtime Warp Progress on GPUs
- Yunho Oh, Myung Kuk Yoon, Jong Hyun Park, Yongjun Park, and Won Woo Ro
- IEEE Transactions on Computers, Vol. 67, No. 9, pp. 1366-1373, Sep. 2018
- Exploiting Pseudo-Quadtree Structure for Accelerating HEVC Spatial Resolution Downscaling Transcoder
- Minsik Kim, Minyong Sung, Minwoo Kim, and Won Woo Ro
- IEEE Transactions on Multimedia, Vol. 20, No. 9, pp. 2262-2275, Sep. 2018
- Architectural Protection of Application Privacy against Software and Physical Attacks in Untrusted Cloud Environment
- Lei Xu, JongHyuk Lee, Seung Hun Kim, Qingji Zheng, Shouhuai Xu, Taeweon Suh, Won Woo Ro, and Weidong Shi
- IEEE Transactions on Cloud Computing, Vol. 6, No. 2, pp. 478-491, Apr-Jun. 2018
- Simultaneous and Speculative Thread Migration for Improving Energy Efficiency of Heterogeneous Core Architectures
- Changmin Lee and Won Woo Ro
- IEEE Transactions on Computers, Vol. 67, No. 4, pp. 498-512, Apr. 2018
Conference Papers
- Region of Interest based Frame Rate Up-Conversion using Encoded Bit-stream
- Kyungah Kim and Won Woo Ro
- International Conference on Communication, Image and Signal Processing
- (CCISP 2018)
- Sanya, China. Nov. 16 - 18, 2018
- FineReg: Fine-Grained Register File Management for Augmenting GPU Throughput
- Yunho Oh, Myung Kuk Yoon, William J. Song, and Won Woo Ro
- The 51st IEEE/ACM International Symposium on Microarchitecture
- (MICRO 2018)
- Fukuoka, Japan, Oct. 20 - 24, 2018
- Fast Intra LCU Decision using Deep Neural Networks
- Kyungah Kim and Won Woo Ro
- The International Conference On Big data, IoT, and Cloud Computing
- (BIC-18)
- Jeju, Korea, Aug. 20 - 22, 2018
- Near-Data Processing Optimization for Efficient Neural Network Computations
- Sungwoo Ahn, Won Jeon, and Won Woo Ro
- The 3rd International Conference On Consumer Electronics Asia
- (ICCE-ASIA 2018)
- Jeju, Korea, Jun. 24 - 26, 2018
- Constructing Resilient Region in Dynamic Optimization Systems via Dynamic Adjustment of Bias Thresholds
- Ipoom Jeong and Won Woo Ro
- The 3rd International Conference On Consumer Electronics Asia
- (ICCE-ASIA 2018)
- Jeju, Korea, Jun. 24 - 26, 2018
- WIR: Warp Instruction Reuse to Minimize Repeated Computations in GPUs
- Keunsoo Kim and Won Woo Ro
- The 24th International IEEE Symposium on High Performance Computer Architecture
- (HPCA 2018)
- Wien, Austria, Feb. 24 - 28, 2018
- Efficient and Reliable NAND Flash Channel for High-Speed Solid State Drives
- Joohyeong Yoon, Won Seob Jeong, Won Jeon, and Won Woo Ro
- The 17th International Conference on Electronics, Information and Communication
- (ICEIC 2018)
- Honolulu, HI, USA, Jan. 24 - 27, 2018
- Fast Robot Software Framework with Object-Oriented Design
- Heekuk Lee, Keunsoo Kim, and Won Woo Ro
- The 17th International Conference on Electronics, Information and Communication
- (ICEIC 2018)
- Honolulu, HI, USA, Jan. 24 - 27, 2018
2017
Journal Papers
- Dynamic Resizing on Active Warps Scheduler to Hide Operation Stalls on GPUs
- Myung Kuk Yoon, Yunho Oh, Sangpil Lee, Seung Hun Kim, Deokho Kim, and Won Woo Ro
- IEEE Transactions on Parallel and Distributed Systems, Vol. 28, No. 11, pp. 3142-3156, Nov. 2017
- Dynamic Load Balancing of Dispatch Scheduling for Solid State Disks
- Myunghyun Jo and Won Woo Ro
- IEEE Transactions on Computers, Vol. 66, No. 6, pp. 1034-1047, Jun. 2017
- Improving Energy Efficiency of GPUs through Data Compression and Compressed Execution
- Sangpil Lee, Keunsoo Kim, Gunjae Koo, Hyeran Jeon, Murali Annavaram, and Won Woo Ro
- IEEE Transactions on Computers, Vol. 66, No. 5, pp. 834-847, May 2017
Conference Papers
- Parallel In-Order Execution Architecture for Low-Power Processor
- Kyungmin Lee, Ipoom Jeong, and Won Woo Ro
- The 14th International SoC Design Conference
- (ISOCC 2017)
- Seoul, Korea, Nov. 5 - 8, 2017
- Characterizing Convolutional Neural Network Workloads on a Detailed GPU Simulator
- Kwanghee Chang, Minsik Kim, Kyungah Kim, and Won Woo Ro
- The 14th International SoC Design Conference
- (ISOCC 2017)
- Seoul, Korea, Nov. 5 - 8, 2017
- Access Pattern-Aware Cache Management for Improving Data Utilization in GPU
- Gunjae Koo, Yunho Oh, Won Woo Ro, and Murali Annavaram
- The 44th ACM/IEEE International Symposium on Computer Architecture
- (ISCA 2017)
- Torronto, Canada, Jun. 24 - 28, 2017
- Dynamic Warp Scheduler Selection Policy Using Linear Regression for GPUs
- Hyunjune Shin, Kyungmin Lee, Ipoom Jeong, Jong Hyun Park, and Won Woo Ro
- The 16th International Conference on Electronics, Information and Communication
- (ICEIC 2017)
- Phuket, Thailand, Jan. 11 - 14, 2017
- Exploiting L2 Cache Sensitivity in Artificial Neural Network on GPUs
- Seihoon Park, Yoonsoo Kim, Minsik Kim, and Won Woo Ro
- The 16th International Conference on Electronics, Information and Communication
- (ICEIC 2017)
- Phuket, Thailand, Jan. 11 - 14, 2017
- Optimizing Intersection and Reflection Step of Geometrical Optics using GPUs
- Hyun Jin Chung, Myung Kuk Yoon, and Won Woo Ro
- The 16th International Conference on Electronics, Information and Communication
- (ICEIC 2017)
- Phuket, Thailand, Jan. 11 - 14, 2017
- Analysis of Error Tolerance in Convolution Neural Networks
- Sangheon Kwon, Jong Hyun Park, and Won Woo Ro
- The 16th International Conference on Electronics, Information and Communication
- (ICEIC 2017)
- Phuket, Thailand, Jan. 11 - 14, 2017
2016
Journal Papers
- Server Side, Play Buffer Based Quality Control for Adaptive Media Streaming
- Keunsoo Kim, Benjamin Y. Cho, and Won Woo Ro
- Multimedia Tools and Applications, Vol. 75, No. 10, pp. 5397-5415, May 2016
- Exploiting Thread-Level Parallelism on HEVC by Employing Reference Dependency Graph
- Minwoo Kim, Deokho Kim, Kyungah Kim, and Won Woo Ro
- IEEE Transactions on Circuits and Systems for Video Technology, Vol. 26, No. 4, pp. 736-749, Apr. 2016
- Parallel GPU Architecture Simulation Framework Exploiting Architectural-Level Parallelism with Timing Error Prediction
- Sangpil Lee and Won Woo Ro
- IEEE Transactions on Computers, Vol. 65, No. 4, pp. 1253-1265, Apr. 2016
Conference Papers
- Measuring Error-Tolerance in SRAM Architecture on Hardware Accelerated Neural Network
- Sangheon Kwon, Kyungmin Lee, Yoonsoo Kim, Kyungah Kim, Changmin Lee, and Won Woo Ro
- The 1st IEEE International Conference on Consumer Electronics Asia
- (ICCE-ASIA 2016)
- Seoul, Korea, Oct. 26 - 28, 2016
- Virtual Thread: Maximizing Thread-Level Parallelism beyond GPU Scheduling Limit
- Myung Kuk Yoon, Keunsoo Kim, Sangpil Lee, Won Woo Ro, and Murali Annavaram
- The 43rd ACM/IEEE International Symposium on Computer Architecture
- (ISCA 2016)
- Seoul, Korea, Jun. 18 - 22, 2016
- APRES: Improving Cache Efficiency by Exploiting Load Characteristics on GPUs
- Yunho Oh, Keunsoo Kim, Myung Kuk Yoon, Jong Hyun Park, Yongjun Park, Won Woo Ro, and Murali Annavaram
- The 43rd ACM/IEEE International Symposium on Computer Architecture
- (ISCA 2016)
- Seoul, Korea, Jun. 18 - 22, 2016
- Warped-Slicer: Efficient Intra-SM Slicing through Dynamic Resource Partitioning for GPU Multiprogramming
- Qiumin Xu, Hyeran Jeon, Keunsoo Kim, Won Woo Ro, and Murali Annavaram
- The 43rd ACM/IEEE International Symposium on Computer Architecture
- (ISCA 2016)
- Seoul, Korea, Jun. 18 - 22, 2016
- Warped-Preexecution: A GPU Pre-execution Approach for Improving Latency Hiding
- Keunsoo Kim, Sangpil Lee, Myung Kuk Yoon, Gunjae Koo, Won Woo Ro, and Murali Annavaram
- The 22nd International IEEE Symposium on High Performance Computer Architecture
- (HPCA 2016)
- Barcelona, Spain, Mar. 12 - 16, 2016
- Accelerating Forwading Computation of ANN using CUDA
- Jong Hyun Park and Won Woo Ro
- The 15th International Conference on Electronics, Information and Communication
- (ICEIC 2016)
- Danang, Vietnam, Jan. 27 - 30, 2016
- Fairness-Aware Thread Scheduling for Multithreaded Program using Intel Software Guarded Extensions
- Won Jeon, Seung Hun Kim, and Won Woo Ro
- The 15th International Conference on Electronics, Information and Communication
- (ICEIC 2016)
- Danang, Vietnam, Jan. 27 - 30, 2016
2015
Journal Papers
- A Performance-Energy Model to Evaluate Single Thread Execution Acceleration
- Seung Hun Kim, Dohoon Kim, Changmin Lee, Won Seob Jeong, Won Woo Ro, and Jean-Luc Gaudiot
- IEEE Computer Architecture Letters, Vol.14, No.2, pp. 99-102, Dec. 2015
- Dynamic Load Balancing of Parallel SURF with Vertical Partitioning
- Deokho Kim, Minwoo Kim, Kyungah Kim, Minyong Sung, and Won Woo Ro
- IEEE Transactions on Parallel and Distributed Systems, Vol. 26, No. 12, pp. 3358-3370, Dec. 2015
- Network Variation and Fault Tolerant Performance Acceleration in Mobile Devices with Simultaneous Remote Execution
- Keunsoo Kim, Benjamin Y. Cho, Won Woo Ro, and Jean-Luc Gaudiot
- IEEE Transactions on Computers, Vol. 64, No. 10, pp. 2862-2874, Oct. 2015
- Highly Secure Mobile Devices Assisted with Trusted Cloud Computing Environments
- Doohwan Oh, Ilkyu Kim, Keunsoo Kim, Sang-Min Lee, and Won Woo Ro
- ETRI Journal, Vol. 37, No. 2, pp. 348-358, Apr. 2015
Conference Papers
- True Motion Compensation With Feature Detection for Frame Rate Up-Conversion
- Kyungah Kim, Minwoo Kim, Deokho Kim, and Won Woo Ro
- The 2015 IEEE International Conference on Image Processing
- (ICIP 2015)
- Quebec City, Canada, Sep. 27 - 30, 2015
- An Accelerated Separable Median Filter with Sorting Networks
- Minsik Kim, Deokho Kim, Minyong Sung, and Won Woo Ro
- The 2015 IEEE International Conference on Image Processing
- (ICIP 2015)
- Quebec City, Canada, Sep. 27 - 30, 2015
- Contention-Free Fair Queuing for High-Speed Storage with RAID-0 Architecture
- Myung Hyun Jo and Won Woo Ro
- The 17TH IEEE International Conference on High Performance Computing and Communications
- (HPCC 2015)
- New York, USA, Aug. 24 - 26, 2015
- Integrity Protection for Big Data Processing with Dynamic Redundancy Computation
- Zhimin Gao, Nicholas DeSalvo, Pham Dang Khoa, Seung Hun Kim, Lei Xu, Won Woo Ro, Rakesh M. Verma,
and Weidong Shi
- The 2015 IEEE International Conference on Autonomic Computing
- (ICAC 2015)
- Grenoble, France, July 7 - 10, 2015
- Improving Pipeline Utilization with Two-Level Instruction Issue on GPUs
- Yunho Oh, Jong Hyun Park, and Won Woo Ro
- The 30th International Techinical Conference on Circuits/Systems, Computers and Communicaions
- (ITC-CSCC 2015)
- Seoul, Korea, Jun. 29 - July 2, 2015
- Accelerating ELMs on the GPU Toward Real-Time Training on Large Scale Data Sets
- Han Kyul Kim, Jong Hyun Park, and Won Woo Ro
- The 30th International Technical Conference on Circuits/Systems, Computers and Communications
- (ITC-CSCC 2015)
- Seoul, Korea, Jun. 29 - July 2, 2015
- A Frequency Scaling Model for Energy Efficient DVFS Designs based on Circuit Delay Optimization
- Ki Bum Chun, Changmin Lee and Won Woo Ro
- The 19th IEEE International Symposium on Consumer Electronics
- (ISCE 2015)
- UPM, Madrid, Spain, Jun. 24 - 26, 2015
- Another Look at Secure Big Data Processing: a Formal Framework and a Practical Approach
- Lei Xu, Seung Hun Kim, Won Woo Ro, and Weidong Shi
- The 8th IEEE International Conference on Cloud Computing
- (Cloud'15, Application Track)
- New York, USA, Jun. 27 - July 2, 2015
- Enhancing Software Dependability and Security with Hardware Supported Instruction Address Space Randomization
- Seung Hun Kim, Lei Xu, Ziyi Liu, Zhiqiang Lin, Won Woo Ro, and Weidong Shi
- The 45th Annual IEEE/IFIP International Conference on Dependable Systems and Networks
- (DSN 2015)
- Rio de Janerio, Brazil, Jun. 22 - 25, 2015
- Warped-Compression: Enabling Power Efficient GPUs through Register Compression
- Sangpil Lee, Keunsoo Kim, Gunjae Koo, Hyeran Jeon, Won Woo Ro, and Murali Annavaram
- The 42nd ACM/IEEE International Symposium on Computer Architecture
- (ISCA 2015)
- Portland, OR, USA, Jun. 13 - 17, 2015
- DRAW: Investigating Benefits of Adaptive Fetch Group Size on GPU
- Myung Kuk Yoon, Yunho Oh, Sangpil Lee, Seung Hun Kim, Deokho Kim, and Won Woo Ro
- The 2015 IEEE International Symposium on Performance Analysis of Systems and Software
- (ISPASS 2015)
- Philadelphia, PA, USA, Mar. 29 - 31, 2015
2014
Journal Papers
- A Malicious Pattern Detection Engine for Embedded Security Systems in Internet of Things
- Doohwan Oh, Deokho Kim, and Won Woo Ro
- Sensors, Vol. 14, No. 12, pp. 24188-24211, Dec. 2014
-
C-Lock: Energy Efficient Synchronization for Embedded Multicore Systems
- Seung Hun Kim, Sang Hyong Lee, Minje Jun, Byunghoon Lee, Won Woo Ro, Eui-Young Chung,
and Jean-Luc Gaudiot
- IEEE Transactions on Computers, Vol. 63, No. 8, pp. 1962-1974, Aug. 2014
- Swarm Processor System: Hardware Process Scheduler based Energy Efficient Multi-Core System
- Won Seob Jeong, Seung Hun Kim, Sang-Min Lee, and Won Woo Ro
- IEICE Electronics Express, Vol. 11, No. 14, pp. 20140424, July 2014
- Complexity-Effective Contention Management with Dynamic Backoff for Transactional Memory Systems
- Seung Hun Kim, Dongmin Choi, Won Woo Ro, and Jean-Luc Gaudiot
- IEEE Transactions on Computers, Vol. 63, No. 7, pp. 1696-1708, July 2014
- Architectural Investigation of Matrix Data Layout on Multicore Processors
- Minwoo Kim and Won Woo Ro
- Future Generation Computer Systems, Vol. 37, pp. 64-75, July 2014
- Exploiting Implementation Diversity and Partial Connection of Routers in Application-Specific Network-on-Chip Topology Synthesis
- Minje Jun, Won Woo Ro, and Eui-Young Chung
- IEEE Transactions on Computers, Vol. 63, No. 6, pp. 1434-1445, Jun. 2014
- Accelerating MapReduce Framework on Multi-GPU Systems
- Hai Jiang, Yi Chen, Zhi Qiao, Kuan-Ching Li, Won Woo Ro, and Jean-Luc Gaudiot
- Cluster Computing, Vol. 17, No. 2, pp. 293-301, Jun. 2014
- Boosting CUDA Applications with CPU-GPU Hybrid Computing
- Changmin Lee, Won Woo Ro, and Jean-Luc Gaudiot
- International Journal of Parallel Programming, Vol. 42, No. 2, pp. 384-404, Apr. 2014
- This is an extension of our INTERACT-16 paper which has been selected as one of the best papers and recommended to IJPP.
Conference Papers
- LUT based Secure Cloud Computing - an Implementation using FPGAs
- Lei Xu, Pham Dang Khoa, Seung Hun Kim, Won Woo Ro, and Weidong Shi
- 2014 International Conference on ReConFigurable Computing and FPGAs
- (ReConFig 2014)
- Cancun, Mexico, Dec. 7 - 10, 2014
- Workload Synthesis: Generating Benchmark Workloads from Statistical Execution Profile
- Keunsoo Kim, Changmin Lee, Jung Ho Jung, and Won Woo Ro
- IEEE International Symposium on Workload Characterization
- (IISWC 2014)
- Raleigh, North Carolina, USA, Oct. 26 - 28, 2014
- Accelerating Gesture Recognition Algorithm Using Coarse Grained Reconfigurable Architectures
- Minsik Kim, Deokho Kim, Minyong Sung, Wonjae Lee, Jaehyun Kim, and Won Woo Ro
- The 4th International Conference on Audio, Language and Image Processing
- (ICALIP 2014)
- Shanghai, China, July 7 - 9, 2014
- A Micro-benchmark Suite to Understand Micro-Architectural Differences between Processors
- Changmin Lee, Keunsoo Kim, Jung Ho Jung, and Won Woo Ro
- The 29th International Technical Conference on Circuits/Systems, Computers and Communications
- (ITC-CSCC 2014)
- Phuket, Thailand, July 1 - 4, 2014
- Maximizing DRAM Performance using Selective Operating Frequency Boosting
- Jung Ho Jung, Seung Hun Kim, Changmin Lee, and Won Woo Ro
- The 18th International Symposium on Consumer Electronics
- (ISCE 2014)
- Jeju, Korea, Jun. 22 - 25, 2014
- Workload and Variation Aware Thread Scheduling for Heterogeneous Multi-processor
- Seungwon Lee and Won Woo Ro
- The 18th International Symposium on Consumer Electronics
- (ISCE 2014)
- Jeju, Korea, Jun. 22 - 25, 2014
- Best paper award, Bronze prize
- DPM: Data Partitioning Method for Pipelined MapReduce on GPU
- Myung Hyun Jo and Won Woo Ro
- The 18th International Symposium on Consumer Electronics
- (ISCE 2014)
- Jeju, Korea, Jun. 22 - 25, 2014
- Accelerating HEVC Transcoder by Exploiting Decoded Quadtree
- Minyong Sung, Minwoo Kim, Minsik Kim, and Won Woo Ro
- The 18th International Symposium on Consumer Electronics
- (ISCE 2014)
- Jeju, Korea, Jun. 22 - 25, 2014
- Multicore Speedup Models using Frequency Scaling with Fixed Power Budget
- Seungwon Lee, Seung Hun Kim, and Won Woo Ro
- The 13th International Conference on Electronics, Information and Communication
- (ICEIC 2014)
- Kota Kinabalu, Malaysia, Jan. 15 - 18, 2014
- Hyper Threading-aware Virtual Machine Migration
- Chungmu Oh, and Won Woo Ro
- The 13th International Conference on Electronics, Information and Communication
- (ICEIC 2014)
- Kota Kinabalu, Malaysia, Jan. 15 - 18, 2014
- Development of Efficient VCPU Pinning Mechanism in Xen
- Kyung Yoon Min, Seung Hun Kim, and Won Woo Ro
- The 13th International Conference on Electronics, Information and Communication
- (ICEIC 2014)
- Kota Kinabalu, Malaysia, Jan. 15 - 18, 2014
2013
Journal Papers
- Parallelized Sub-Resource Loading for Web Rendering Engine
- Deokho Kim, Changmin Lee, Sangpil Lee, and Won Woo Ro
- Journal of Systems Architecture, Vol. 59, No. 9, pp. 785-793, Oct. 2013
- Design and Evaluation of Random Linear Network Coding Accelerators on FPGAs
- Sunwoo Kim, Won Seob Jeong, Won Woo Ro, and Jean-Luc Gaudiot
- ACM Transactions on Embedded Computing Systems, Vol.13, No. 1, pp. 1-24, Aug. 2013
- GPU-Friendly Parallel Genome Matching with Tiled Access and Reduced State Transition Table
- Yunho Oh, Doohwan Oh, and Won Woo Ro
- International Journal of Parallel Programming, Vol. 41, No. 4, pp. 526-551, Aug. 2013
- A Distributed Signature Detection Method for Detecting Intrusions in Sensor Systems
- Ilkyu Kim, Doohwan Oh, Myung Kuk Yoon, Kyueun Yi, and Won Woo Ro
- Sensors, Vol. 13, No. 4, pp. 3998-4016, Mar. 2013
- Exploiting SIMD Parallelism on Dynamically Partitioned Parallel Network Coding for P2P Systems
- Deokho Kim, Karam Park, and Won Woo Ro
- Computers & Electrical Engineering, Vol. 39, No. 1, pp. 55-56, Jan. 2013
- Benefits of Using Parallelized Non-Progressive Network Coding
- Minwoo Kim, Karam Park, and Won Woo Ro
- Journal of Network and Computer Applications, Vol. 36, No. 1, pp. 293-305, Jan. 2013
- Importance of Coherence Protocols with Network Applications on Multi-Core Processors
- Kyueun Yi, Won Woo Ro, and Jean-Luc Gaudiot
- IEEE Transactions on Computers, Vol. 62, No. 1, pp. 6-15, Jan. 2013
Conference Papers
- Effcient Descriptor-Filtering Algorithm for Speeded Up Robust Features Matching
- Minwoo Kim, Deokho Kim, Kyungah Kim, and Won Woo Ro
- The 5th FTRA International Conference on Computer Science and its Applications
- (CSA-13)
- Danang, Vietnam, Dec. 18 - 21, 2013
- XSD: Accelerating MapReduce by Harnessing the GPU inside an SSD
- Benjamin Y. Cho, Won Seob Jeong, Doohwan Oh, and Won Woo Ro
- The 1st Workshop on Near-Data Processing. In conjunction with the MICRO-46
- (WoNDP 2013)
- Davis, USA, Dec. 8, 2013
- Mark-Sharing: A Parallel Garbage Collection Algorithm for Low Synchronization Overhead
- Hyunkyu Park, Changmin Lee, Seung Hun Kim, Won Woo Ro and Jean-Luc Gaudiot
- The 19th IEEE International Conference on Parallel and Distributed Systems
- (ICPADS 2013)
- Seoul, Korea, Dec. 15 - 18, 2013
- Leveraging Effectiveness of Contention Management for Transactional Memory Systems with Performance Monitoring
- Keunsoo Kim, Seung Hun Kim, Sang-min Lee, and Won Woo Ro
- The 28th International Technical Conference on Circuits/Systems, Computer and Communications
- (ITC-CSCC 2013)
- Yeosu, Korea, Jun. 30 - July 3, 2013
- MGMR: Multi-GPU Based MapReduce
- Yi Chen, Zhi Qiao, Hai Jiang, Kuan-Ching Li, Won Woo Ro
- The 8th International Conference on Grid and Pervasive Computing
- (GPC 2013)
- Seoul, Korea, May. 9 - 11, 2013
- Parallel GPU Architecture Simulation Framework Exploiting Work Allocation Unit Parallelism
- Sangpil Lee and Won Woo Ro
- The 2013 IEEE International Symposium on Performance Analysis of Systems and Software
- (ISPASS 2013)
- Austin, TX, USA, Apr. 21 - 23, 2013
- Directory Centralized Ring-based Interconnection for Multi-Core Systems
- Myung Kuk Yoon, Sangpil Lee, Deokho Kim, and Won Woo Ro
- The 12th International Conference on Electronics, Information and Communication
- (ICEIC 2013)
- Bali, Indonesia, Jan. 30 - Feb. 2, 2013
- Parallel Garbage Collection with Transactional Memory
- Hyunkyu Park, Changmin Lee, and Won Woo Ro
- The 12th International Conference on Electronics, Information and Communication
- (ICEIC 2013)
- Bali, Indonesia, Jan. 30 - Feb. 2, 2013
2012
Journal Papers
- Multi-Threading and Suffix Grouping on Massive Multiple Pattern Matching Algorithm
- Doohwan Oh and Won Woo Ro
- The Computer Journal, Vol. 55, No. 11, pp. 1331-1346, Nov. 2012
- Offloading of Media Transcoding for High-Quality Multimedia Services
- Seung Hun Kim, Keunsoo Kim, Changmin Lee, and Won Woo Ro
- IEEE Transactions on Consumer Electronics, Vol. 58, No. 2, pp. 691-699, May 2012
- Design of a Power-Efficient Parallel Pipelined Bloom Filter
- Deokho Kim, Doohwan Oh, and Won Woo Ro
- Electronics Letters, Vol. 48, No. 7, pp. 367-369, Mar. 2012
- Reconfigurable and Parallelized Network Coding Decoder for VANETs
- Sunwoo Kim and Won Woo Ro
- Mobile Information Systems, Vol. 8, No. 1, pp. 45-59, Feb. 2012
- Accelerated Network Coding with Dynamic Stream Decomposition on Graphics Processing Unit
- Sangpil Lee and Won Woo Ro
- The Computer Journal, Vol. 55, No. 1, pp. 21-34, Jan. 2012
Conference Papers
- On Migration and Consolidation of VMs in Hybrid CPU-GPU Environments
- Kuan-Ching Li, Keunsoo Kim, Won Woo Ro, Tien-Hsiung Weng, Che-Lun Hung, Chen-Hao Ku, Albert Cohen, and Jean-Luc Gaudiot
- International Conference on Intelligent Technologies and Engineering Systems
- (ICITES 2012) - LNEE
- Changhua, Taiwan, Dec. 13-15, 2012
- Conflict Avoidance Scheduling using Grouping List for Transactional Memory
- Dongmin Choi, Seung Hun Kim, and Won Woo Ro
- The 17th International Workshop on High-Level Parallel Programming Models and Supportive Environments
- (HIPS-17)
- Shanghai, China, May 21, 2012
- Cooperative Heterogeneous Computing for Parallel Processing on CPU/GPU Hybrids
- Changmin Lee, Won Woo Ro, and Jean-Luc Gaudiot
- The 16th Workshop on Interaction between Compilers and Computer Architectures
- (INTERACT-16)
- New Orleans, USA, Feb. 25 - 29, 2012
- Matrix Data Layout Optimization for Multi-Core Architectures
- Minwoo Kim, and Won Woo Ro
- The 11th International Conference on Electronics, Information and Communication
- (ICEIC 2012)
- Jeongseon, Korea, Feb. 1 - 3, 2012
- The Effect of Concurrency Control in Transactional Memory Systems
- Seung Hun Kim, Dongmin Choi, and Won Woo Ro
- The 11th International Conference on Electronics, Information and Communication
- (ICEIC 2012)
- Jeongseon, Korea, Feb. 1 - 3, 2012
- Adaptive Replacement Cache in Transactional Memory
- Dongmin Choi, Hyunkyu Park, Seung Hun Kim, and Won Woo Ro
- The 11th International Conference on Electronics, Information and Communication
- (ICEIC 2012)
- Jeongseon, Korea, Feb. 1 - 3, 2012
2011
Journal Papers
- A Novel Sequential Tree Algorithm Based on Scoreboard for MPI Broadcast Communication
- Won-young Chung, Jae-won Park, Seung-Woo Lee, Won Woo Ro, and Yong-surk Lee
- IEICE Transactions on Information and Systems, Vol 94, No. 12, pp. 2523-2527, December. 2011
- Network Coding on Heterogeneous Multi-Core Processors for Wireless Sensor Networks
- Deokho Kim, Karam Park, and Won W. Ro
- Sensors, Vol 11, No. 8, pp. 7908-7933, Aug. 2011
- A Low-Cost Standard Mode MPI Hardware Unit for Embedded MPSoC
- Won-Young Chung, Ha-Young Jeong, Won W. Ro, and Yong-Surk Lee
- IEICE Transactions on Information and Systems, Vol. E94-D, No.7, pp. 1497-1501, July 2011
Conference Papers
- Parallel Transpose of Matrix Multiplication Based on the Tiling Algorithm
- Minwoo Kim, Yong J. Jang, and Won W. Ro
- The 54th IEEE International Midwest Symposium on Circuits and Systems
- (MWSCAS 2011)
- Seoul, Korea, Aug. 7 - 10, 2011
- Performance Evaluation of Adaptive Progressive Network Coding
- Deokho Kim, Karam Park, and Won W. Ro
- The 54th IEEE International Midwest Symposium on Circuits and Systems
- (MWSCAS 2011)
- Seoul, Korea, Aug. 7 - 10, 2011
2010
Journal Papers
- Multithreaded Pattern Matching Algorithm with Data Rearrangement
- Doohwan Oh, Seung Hun Kim, and Won W. Ro
- IEICE Electronics Express, Vol. 7, No. 20, pp. 1520-1526, Oct. 2010
- On Improving Parallelized Network Coding with Dynamic Partitioning
- Karam Park, Joon-Sang Park, and Won W. Ro
- IEEE Transactions on Parallel and Distributed Systems, Vol. 21, No. 11, pp. 1547-1560, Nov. 2010
- Hardware Implementation of a Tessellation Accelerator for the OpenVG Standard
- Seung Hun Kim, Yunho Oh, Karam Park, and Won W. Ro
- IEICE Electronics Express, Vol. 7, No. 6, pp. 440-446, Mar. 2010
Conference Papers
- Development of Virtual CUDA Systems of Parallel Processing on CPU and GPGPU
- Doohwan Oh, Sangpil Lee, Deokho Kim, Changmin Lee, and Won W. Ro
- Workshop on Micro Architectural Support for Virtualization, Data Center Computing, and Clouds In Conjunction with MICRO 2010
- (MASVDC Workshop 2010)
- Atlanta, USA, Dec. 5, 2010
- Implementing FFT using SPMD style of OpenMP
- Tien-Hsiung Weng, Sheng-Wei Huang, Won Woo Ro, and Kuan-Ching Li
- In Proc. of the 6th International Conference on Networked Computing and Advanced Information Management
- (NCM 2010)
- Seoul, Korea, Aug. 16 - 18, 2010
- Multi-Threaded Filtered BackProjection Algorithm on Multi-Core Processors
- Yun H. Oh and Won W. Ro
- The 10th International Conference on Electronics, Information, and Communication
- (ICEIC 2010)
- Cebu, Philippines, Jun. 30 - July 2, 2010
- Accelerated Reconstruction Using Parallel Computing for Spiral Spectroscopic Imaging
- Dong H. Kim, Yun H. Oh, Yun H. Nam, M. Gu, and Won W. Ro
- In Proc. of 2010 International Society for Magnetic Resonance in Medicine Annual Meeting
- (2010 ISMRM Annual Meeting)
- Stockholm, Sweden, May 1 - 7, 2010
- FPGA Implementation of Highly Parallelized Decoder Logic for Network Coding
- Sunwoo Kim and Won W. Ro
- In Proc. of Eighteenth ACM/SIGDA International Symposium on Field-Programmable Gate Arrays
- (FPGA 2010)
- Monterey, USA, Feb. 21 - 23, 2010
2009
Journal Papers
- A Complexity-Effective Microprocessor Design with Decoupled Dispatch Queues and Prefetching
- Won W. Ro and Jean-Luc Gaudiot
- Parallel Computing, Vol. 35, No. 5, pp. 255-268, May 2009
Conference Papers
- Evaluation of Cache Coherence Protocols on Multi-Core Systems with Linear Workloads
- Yong J. Jang and Won W. Ro
- In Proc. of 2009 International Colloquium on Computing, Communication, Control, and Management
- (CCCM 2009)
- Sanya, China, Aug. 8 - 9, 2009
- Comparing Open Source Web Services: gSoap and AXIS
- Jongwook Woo and Won W. Ro
- In Proc. of the 24th International Technical Conference on Circuits/Systems, Computers and Communications
- (ITC-CSCC 2009)
- Jeju Island, Korea, July 5 - 8, 2009
- Efficient Parallelized Network Coding for P2P File Sharing Applications
- Karam Park, Joon-Sang Park, and Won W. Ro
- In Proc. of the 4th International Conference on Grid and Pervasive Computing
- (GPC 2009)
- Geneva, Switcherland, May 4 - 8, 2009
- Fully Pipelined Hardware Implementation of 128-bit SEED Block Cipher Algorithm
- Jaeyoung Yi, Karam Park, Joonseok Park, and Won W. Ro
- In Proc. of the 5th International Workshop on Applied Reconfigurable Computing
- (ARC 2009)
- Karlsruhe, Germany, Mar. 16 - 18, 2009
Book Chapters
- Programmability and Scalability on Multi-Core Architectures
- Jaeyoung Yi, Yong J. Jang, Doohwan Oh, and Won W. Ro
- Chapter in "Handbook of Research on Scalable Computing Technologies", edited by Kuan-Ching Li, Ching-Hsien Hsu, Laurence Tianruo Yang, Jack Dongarra, and Hans Zima, Information Science Reference, 2009
2008
Journal Papers
- Efficient Peer-to-Peer File Sharing Using Network Coding in MANET
- Uichin Lee, Joon-Sang Park, Seung-Hoon Lee, Won W. Ro, Giovanni Pau, and Mario Gerla
- Journal of Communications and Networks, Vol. 10, No. 4, Dec. 2008
- A Low-Complexity Microprocessor Design with Speculative Pre-Execution
- Won W. Ro and Jean-Luc Gaudiot
- Journal of Systems Architecture, Vol. 54, No. 12, pp. 1101-1112, Dec. 2008
- Performance Evaluation of Programming Models for SMP-Based Clusters
- Myungho Lee, Neungsoo Park, Won W. Ro, and Kuan-Ching Li
- Journal of the Chinese Institute of Engineers, Vol. 31, No. 7, pp. 1181-1188, Dec. 2008
- Simultaneous Thin-Thread Processors for Low-Power Embedded Systems
- Won W. Ro, Jaeyoung Yi, Joon-Sang Park, and Joonseok Park
- IEICE Electronics Express, Vol. 5, No. 19, pp. 802-808, Oct. 2008
- Delay Analysis of Car-to-Car Reliable Data Delivery Strategies Based on Data Mulling with Network Coding
- Joon-Sang Park, Uichin Lee, Soon Young Oh, Mario Gerla, Desmond Siumen Lun, Won W. Ro, and Joonseok Park
- IEICE Transactions on Information and Systems, Vol. E91-D, No. 10, Oct. 2008
Conference Papers
- Parallel Algorithms for Steiner Tree Problem
- Joon-Sang Park, Won W. Ro, Handuck Lee, and Neungsoo Park
- In Proc. of the 3rd International Conference on Convergence and Hybrid Information Technology
- (ICHIT 2008)
- Busan, Korea, Nov. 11 - 13, 2008
2006
Journal Papers
- Design and Evaluation of a Hierarchical Decoupled Architecture
- Won W. Ro, Stephen P. Crago, Alvin M. Despain, and Jean-Luc Gaudiot
- Journal of Supercomputing, Springer, Vol. 38, No. 3, pp. 237-259, Dec. 2006
- Speculative Pre-Execution Assisted by Compiler (SPEAR)
- Won W. Ro and Jean-Luc Gaudiot
- Journal of Parallel and Distributed Computing, Elsevier, Vol. 66, No. 8, pp. 1076-1089, Aug. 2006
Conference Papers
- Design and Effectiveness of Small-Sized Decoupled Dispatch Queues
- Won W. Ro and Jean-Luc Gaudiot
- In Proc. of European Conference on Parallel Computing - LNCS
- (EURO-PAR 2006)
- Dresden, Germany, Aug. 29 - Sep. 1, 2006
2005
Conference Papers
- A Low-Complexity Issue Queue Design with Speculative Pre-Execution
- Won W. Ro and Jean-Luc Gaudiot
- In Proc. of the 12th International Conference on High Performance Computing
- (HiPC 2005)
- Goa, India, Dec. 18 - 21, 2005
Book Chapters
- Techniques to Improve Performance Beyond Pipelining: Superpipelining, Superscalar, and VLIW
- Jean-Luc Gaudiot, Jung-Yup Kang, and Won Woo Ro
- Chapter in "Computer Architecture", a volume of "Advance in Computers", edited by Ali R.Hurson, Elsevier, 2005
2004
Conference Papers
- SPEAR: A Hybrid Model for Speculative Pre-Execution
- Won W. Ro and Jean-Luc Gaudiot
- In Proc. of the 18th International Parallel and Distributed Processing Symposium
- (IPDPS 2004)
- Santa Fe, New Mexico, 2004
2003
Conference Papers
- HiDISC: A Decoupled Architecture for Data-Intensive Applications
- Won W. Ro, Jean-Luc Gaudiot, Stephen P. Crago, and Alvin M. Despain
- In Proc. of the 17th International Parallel and Distributed Processing Symposium
- (IPDPS 2003)
- Nice, France, Apr. 22 - 26, 2003
- Compiler Support for Dynamic Speculative Pre-Execution
- Won W. Ro and Jean-Luc Gaudiot
- In Proc. of the 7th Annual Workshop on Interaction between Compilers and Computer Architectures
- (INTERACT-7) in conjunction with HPCA-9
- Anaheim, California, Feb. 8, 2003
2000
Conference Papers
- Memory Latency: to Tolerate or to Reduce?
- Amol Bakshi, Jean-Luc Gaudiot, Wen-Yen Lin, Manil Makhija, Viktor K. Prasanna, Wonwoo Ro, and Chulho Shin
- In Proc. of the 12th Symposium on Computer Architecture and High Performance Computing
- (SBAC-PAD'00)
- Sao Pedro, Brazil, Oct. 24 - 27, 2000
- A High-Performance, Hierarchical Decoupled Architecture
- Stephen P. Crago, Alvin Despain, Jean-Luc Gaudiot, Manil Makhija, Wonwoo Ro, and Apoorv Srivastava
- In Proc. of the Memory access Decoupling for superscalar and multiple issue Architectures
- (MEDEA) Workshop in conjunction with PACT 2000
- Philadelphia, Oct. 15, 2000
- A Reliable Cluster Computing with a New Checkpointing RAID-x Architecture
- Kai Hwang, Hai Jin, Roy Ho, and Wonwoo Ro
- In Proc. of the 9th Heterogeneous Computing Workshop
- (HCW)
- Cancun, Mexico, May 1, 2000