Selected Publications

In Press

  • Dynamic Resizing on Active Warps Scheduler to Hide Operation Stalls on GPUs
  • Myung Kuk Yoon, Yunho Oh, Sangpil Lee, Seung Hun Kim, Deokho Kim, and Won Woo Ro
  • IEEE Transactions on Parallel and Distributed Systems, Accepted

2017

Journal Papers

  • Improving Energy Efficiency of GPUs through Data Compression and Compressed Execution
  • Sangpil Lee, Keunsoo Kim, Gunjae Koo, Hyeran Jeon, Murali Annavaram, and Won Woo Ro
  • IEEE Transactions on Computers, Vol. 66, No. 5, pp. 834-847, May 2017
  • Dynamic Load Balancing of Dispatch Scheduling for Solid State Disks
  • Myunghyun Jo and Won Woo Ro
  • IEEE Transactions on Computers, Vol. 66, No. 6, pp. 1034-1047, June 2017

Conference Papers

  • Access Pattern-Aware Cache Management for Improving Data Utilization in GPU
  • Gunjae Koo, Yunho Oh, Won Woo Ro, and Murali Annavaram
  • The 44rd ACM/IEEE International Symposium on Computer Architecture
  • (ISCA 2017)
  • Torronto, Canada, Jun. 24 - 28, 2017

2016

Journal Papers

  • Server Side, Play Buffer Based Quality Control for Adaptive Media Streaming
  • Keunsoo Kim, Benjamin Y. Cho, and Won Woo Ro
  • Multimedia Tools and Applications, Vol. 75, No. 10, pp. 5397-5415, May 2016
  • Exploiting Thread-Level Parallelism on HEVC by Employing Reference Dependency Graph
  • Minwoo Kim, Deokho Kim, Kyungah Kim, and Won Woo Ro
  • IEEE Transactions on Circuits and Systems for Video Technology, Vol. 26, No. 4, pp. 736-749, Apr. 2016
  • Parallel GPU Architecture Simulation Framework Exploiting Architectural-Level Parallelism with Timing Error Prediction
  • Sangpil Lee and Won Woo Ro
  • IEEE Transactions on Computers, Vol. 65, No. 4, pp. 1253-1265, Apr. 2016

Conference Papers

  • Virtual Thread: Maximizing Thread-Level Parallelism beyond GPU Scheduling Limit
  • Myung Kuk Yoon, Keunsoo Kim, Sangpil Lee, Won Woo Ro, and Murali Annavaram
  • The 43rd ACM/IEEE International Symposium on Computer Architecture
  • (ISCA 2016)
  • Seoul, Korea, Jun. 18 - 22, 2016
  • APRES: Improving Cache Efficiency by Exploiting Load Characteristics on GPUs
  • Yunho Oh, Keunsoo Kim, Myung Kuk Yoon, Jong Hyun Park, Yongjun Park, Won Woo Ro, and Murali Annavaram
  • The 43rd ACM/IEEE International Symposium on Computer Architecture
  • (ISCA 2016)
  • Seoul, Korea, Jun. 18 - 22, 2016
  • Warped-Slicer: Efficient Intra-SM Slicing through Dynamic Resource Partitioning for GPU Multiprogramming
  • Qiumin Xu, Hyeran Jeon, Keunsoo Kim, Won Woo Ro, and Murali Annavaram
  • The 43rd ACM/IEEE International Symposium on Computer Architecture
  • (ISCA 2016)
  • Seoul, Korea, Jun. 18 - 22, 2016
  • Warped-Preexecution: A GPU Pre-execution Approach for Improving Latency Hiding
  • Keunsoo Kim, Sangpil Lee, Myung Kuk Yoon, Gunjae Koo, Won Woo Ro, and Murali Annavaram
  • The 22nd IEEE Symposium on High Performance Computer Architecture
  • (HPCA 2016)
  • Barcelona, Spain, Mar. 12 - 16, 2016

2015

Journal Papers

  • A Performance-Energy Model to Evaluate Single Thread Execution Acceleration
  • Seung Hun Kim, Dohoon Kim, Changmin Lee, Won Seob Jeong, Won Woo Ro, and Jean-Luc Gaudiot
  • IEEE Computer Architecture Letters, Vol.14, No.2, pp. 99-102, Dec. 2015
  • Dynamic Load Balancing of Parallel SURF with Vertical Partitioning
  • Deokho Kim, Minwoo Kim, Kyungah Kim, Minyong Sung, and Won Woo Ro
  • IEEE Transactions on Parallel and Distributed Systems, Vol. 26, No. 12, pp. 3358-3370, Dec. 2015
  • Network Variation and Fault Tolerant Performance Acceleration in Mobile Devices with Simultaneous Remote Execution
  • Keunsoo Kim, Benjamin Y. Cho, Won Woo Ro, and Jean-Luc Gaudiot
  • IEEE Transactions on Computers, Vol. 64, No. 10, pp. 2862-2874, Oct. 2015
  • Highly Secure Mobile Devices Assisted with Trusted Cloud Computing Environments
  • Doohwan Oh, Ilkyu Kim, Keunsoo Kim, Sang-Min Lee, and Won Woo Ro
  • ETRI Journal, Vol. 37, No. 2, pp. 348-358, Apr. 2015

Conference Papers

  • True Motion Compensation With Feature Detection for Frame Rate Up-Conversion
  • Kyungah Kim, Minwoo Kim, Deokho Kim, and Won Woo Ro
  • The 2015 IEEE International Conference on Image Processing
  • (ICIP 2015)
  • Quebec City, Canada, Sep. 27 - 30, 2015
  • An Accelerated Separable Median Filter with Sorting Networks
  • Minsik Kim, Deokho Kim, Minyong Sung, and Won Woo Ro
  • The 2015 IEEE International Conference on Image Processing
  • (ICIP 2015)
  • Quebec City, Canada, Sep. 27 - 30, 2015
  • Enhancing Software Dependability and Security with Hardware Supported Instruction Address Space Randomization
  • Seung Hun Kim, Lei Xu, Ziyi Liu, Zhiqiang Lin, Won Woo Ro, and Weidong Shi
  • The 45th Annual IEEE/IFIP International Conference on Dependable Systems and Networks
  • (DSN 2015)
  • Rio de Janerio, Brazil, Jun. 22 - 25, 2015
  • Warped-Compression: Enabling Power Efficient GPUs through Register Compression
  • Sangpil Lee, Keunsoo Kim, Gunjae Koo, Hyeran Jeon, Won Woo Ro, and Murali Annavaram
  • The 42nd ACM/IEEE International Symposium on Computer Architecture
  • (ISCA 2015)
  • Portland, OR, USA, Jun. 13 - 17, 2015
  • DRAW: Investigating Benefits of Adaptive Fetch Group Size on GPU
  • Myung Kuk Yoon, Yunho Oh, Sangpil Lee, Seung Hun Kim, Deokho Kim, and Won Woo Ro
  • The 2015 IEEE International Symposium on Performance Analysis of Systems and Software
  • (ISPASS 2015)
  • Philadelphia, PA, USA, Mar. 29 - 31, 2015

2014

Journal Papers

  • A Malicious Pattern Detection Engine for Embedded Security Systems in Internet of Things
  • Doohwan Oh, Deokho Kim, and Won Woo Ro
  • Sensors, Vol. 14, No. 12, pp. 24188-24211, Dec. 2014
  • C-Lock: Energy Efficient Synchronization for Embedded Multicore Systems
  • Seung Hun Kim, Sang Hyong Lee, Minje Jun, Byunghoon Lee, Won Woo Ro, Eui-Young Chung,
    and Jean-Luc Gaudiot
  • IEEE Transactions on Computers, Vol. 63, No. 8, pp. 1962-1974, Aug. 2014
  • Swarm Processor System: Hardware Process Scheduler based Energy Efficient Multi-Core System
  • Won Seob Jeong, Seung Hun Kim, Sang-Min Lee, and Won Woo Ro
  • IEICE Electronics Express, Vol. 11, No. 14, pp. 20140424, July 2014
  • Complexity-Effective Contention Management with Dynamic Backoff for Transactional Memory Systems
  • Seung Hun Kim, Dongmin Choi, Won Woo Ro, and Jean-Luc Gaudiot
  • IEEE Transactions on Computers, Vol. 63, No. 7, pp. 1696-1708, July 2014
  • Architectural Investigation of Matrix Data Layout on Multicore Processors
  • Minwoo Kim and Won Woo Ro
  • Future Generation Computer Systems, Vol. 37, pp. 64-75, July 2014
  • Exploiting Implementation Diversity and Partial Connection of Routers in Application-Specific Network-on-Chip Topology Synthesis
  • Minje Jun, Won Woo Ro, and Eui-Young Chung
  • IEEE Transactions on Computers, Vol. 63, No. 6, pp. 1434-1445, Jun. 2014
  • Accelerating MapReduce Framework on Multi-GPU Systems
  • Hai Jiang, Yi Chen, Zhi Qiao, Kuan-Ching Li, Won Woo Ro, and Jean-Luc Gaudiot
  • Cluster Computing, Vol. 17, No. 2, pp. 293-301, Jun. 2014
  • Boosting CUDA Applications with CPU-GPU Hybrid Computing
  • Changmin Lee, Won Woo Ro, and Jean-Luc Gaudiot
  • International Journal of Parallel Programming, Vol. 42, No. 2, pp. 384-404, Apr. 2014
  • This is an extension of our INTERACT-16 paper which has been selected as one of the best papers and recommended to IJPP.

Conference Papers

  • LUT based Secure Cloud Computing ‐ an Implementation using FPGAs
  • Lei Xu, Pham Dang Khoa, Seung Hun Kim, Won Woo Ro, and Weidong Shi
  • 2014 International Conference on ReConFigurable Computing and FPGAs
  • (ReConFig 2014)
  • Cancun, Mexico, Dec. 7 - 10, 2014
  • Workload Synthesis: Generating Benchmark Workloads from Statistical Execution Profile
  • Keunsoo Kim, Changmin Lee, Jung Ho Jung, and Won Woo Ro
  • IEEE International Symposium on Workload Characterization
  • (IISWC 2014)
  • Raleigh, North Carolina, USA, Oct. 26 - 28, 2014

2013

Journal Papers

  • Parallelized Sub-Resource Loading for Web Rendering Engine
  • Deokho Kim, Changmin Lee, Sangpil Lee, and Won Woo Ro
  • Journal of Systems Architecture, Vol. 59, No. 9, pp. 785-793, Oct. 2013
  • Design and Evaluation of Random Linear Network Coding Accelerators on FPGAs
  • Sunwoo Kim, Won Seob Jeong, Won Woo Ro, and Jean-Luc Gaudiot
  • ACM Transactions on Embedded Computing Systems, Vol.13, No. 1, pp. 1-24, Aug. 2013
  • GPU-Friendly Parallel Genome Matching with Tiled Access and Reduced State Transition Table
  • Yunho Oh, Doohwan Oh, and Won Woo Ro
  • International Journal of Parallel Programming, Vol. 41, No. 4, pp. 526-551, Aug. 2013
  • A Distributed Signature Detection Method for Detecting Intrusions in Sensor Systems
  • Ilkyu Kim, Doohwan Oh, Myung Kuk Yoon, Kyueun Yi, and Won Woo Ro
  • Sensors, Vol. 13, No. 4, pp. 3998-4016, Mar. 2013
  • Exploiting SIMD Parallelism on Dynamically Partitioned Parallel Network Coding for P2P Systems
  • Deokho Kim, Karam Park, and Won Woo Ro
  • Computers & Electrical Engineering, Vol. 39, No. 1, pp. 55-56, Jan. 2013
  • Benefits of Using Parallelized Non-Progressive Network Coding
  • Minwoo Kim, Karam Park, and Won Woo Ro
  • Journal of Network and Computer Applications, Vol. 36, No. 1, pp. 293-305, Jan. 2013
  • Importance of Coherence Protocols with Network Applications on Multi-Core Processors
  • Kyueun Yi, Won Woo Ro, and Jean-Luc Gaudiot
  • IEEE Transactions on Computers, Vol. 62, No. 1, pp. 6-15, Jan. 2013

Conference Papers

  • XSD: Accelerating MapReduce by Harnessing the GPU inside an SSD
  • Benjamin Y. Cho, Won Seob Jeong, Doohwan Oh, and Won Woo Ro
  • The 1st Workshop on Near-Data Processing. In conjunction with the MICRO-46
  • (WoNDP 2013)
  • Davis, USA, Dec. 8, 2013
  • Mark-Sharing: A Parallel Garbage Collection Algorithm for Low Synchronization Overhead
  • Hyunkyu Park, Changmin Lee, Seung Hun Kim, Won Woo Ro and Jean-Luc Gaudiot
  • The 19th IEEE International Conference on Parallel and Distributed Systems
  • (ICPADS 2013)
  • Seoul, Korea, Dec. 15 - 18, 2013
  • MGMR: Multi-GPU Based MapReduce
  • Yi Chen, Zhi Qiao, Hai Jiang, Kuan-Ching Li, Won Woo Ro
  • The 8th International Conference on Grid and Pervasive Computing
  • (GPC 2013)
  • Seoul, Korea, May. 9 - 11, 2013
  • Parallel GPU Architecture Simulation Framework Exploiting Work Allocation Unit Parallelism
  • Sangpil Lee and Won Woo Ro
  • The 2013 IEEE International Symposium on Performance Analysis of Systems and Software
  • (ISPASS 2013)
  • Austin, TX, USA, Apr. 21 - 23, 2013

2012

Journal Papers

  • Multi-Threading and Suffix Grouping on Massive Multiple Pattern Matching Algorithm
  • Doohwan Oh and Won Woo Ro
  • The Computer Journal, Vol. 55, No. 11, pp. 1331-1346, Nov. 2012
  • Offloading of Media Transcoding for High-Quality Multimedia Services
  • Seung Hun Kim, Keunsoo Kim, Changmin Lee, and Won Woo Ro
  • IEEE Transactions on Consumer Electronics, Vol. 58, No. 2, pp. 691-699, May 2012
  • Design of a Power-Efficient Parallel Pipelined Bloom Filter
  • Deokho Kim, Doohwan Oh, and Won Woo Ro
  • Electronics Letters, Vol. 48, No. 7, pp. 367-369, Mar. 2012
  • Reconfigurable and Parallelized Network Coding Decoder for VANETs
  • Sunwoo Kim and Won Woo Ro
  • Mobile Information Systems, Vol. 8, No. 1, pp. 45-59, Feb. 2012
  • Accelerated Network Coding with Dynamic Stream Decomposition on Graphics Processing Unit
  • Sangpil Lee and Won Woo Ro
  • The Computer Journal, Vol. 55, No. 1, pp. 21-34, Jan. 2012

Conference Papers

  • Conflict Avoidance Scheduling using Grouping List for Transactional Memory
  • Dongmin Choi, Seung Hun Kim, and Won Woo Ro
  • The 17th International Workshop on High-Level Parallel Programming Models and Supportive Environments
  • (HIPS-17)
  • Shanghai, China, May 21, 2012
  • Cooperative Heterogeneous Computing for Parallel Processing on CPU/GPU Hybrids
  • Changmin Lee, Won Woo Ro, and Jean-Luc Gaudiot
  • The 16th Workshop on Interaction between Compilers and Computer Architectures
  • (INTERACT-16)
  • New Orleans, USA, Feb. 25 - 29, 2012

2011

Journal Papers

  • A Novel Sequential Tree Algorithm Based on Scoreboard for MPI Broadcast Communication
  • Won-young Chung, Jae-won Park, Seung-Woo Lee, Won Woo Ro, and Yong-surk Lee
  • IEICE Transactions on Information and Systems, Vol 94, No. 12, pp. 2523-2527, December. 2011
  • Network Coding on Heterogeneous Multi-Core Processors for Wireless Sensor Networks
  • Deokho Kim, Karam Park, and Won W. Ro
  • Sensors, Vol 11, No. 8, pp. 7908-7933, Aug. 2011
  • A Low-Cost Standard Mode MPI Hardware Unit for Embedded MPSoC
  • Won-Young Chung, Ha-Young Jeong, Won W. Ro, and Yong-Surk Lee
  • IEICE Transactions on Information and Systems, Vol. E94-D, No.7, pp. 1497-1501, July 2011

Conference Papers

  • Parallel Transpose of Matrix Multiplication Based on the Tiling Algorithm
  • Minwoo Kim, Yong J. Jang, and Won W. Ro
  • The 54th IEEE International Midwest Symposium on Circuits and Systems
  • (MWSCAS 2011)
  • Seoul, Korea, Aug. 7 - 10, 2011
  • Performance Evaluation of Adaptive Progressive Network Coding
  • Deokho Kim, Karam Park, and Won W. Ro
  • The 54th IEEE International Midwest Symposium on Circuits and Systems
  • (MWSCAS 2011)
  • Seoul, Korea, Aug. 7 - 10, 2011

2010

Journal Papers

  • Multithreaded Pattern Matching Algorithm with Data Rearrangement
  • Doohwan Oh, Seung Hun Kim, and Won W. Ro
  • IEICE Electronics Express, Vol. 7, No. 20, pp. 1520-1526, Oct. 2010
  • On Improving Parallelized Network Coding with Dynamic Partitioning
  • Karam Park, Joon-Sang Park, and Won W. Ro
  • IEEE Transactions on Parallel and Distributed Systems, Vol. 21, No. 11, pp. 1547-1560, Nov. 2010
  • Hardware Implementation of a Tessellation Accelerator for the OpenVG Standard
  • Seung Hun Kim, Yunho Oh, Karam Park, and Won W. Ro
  • IEICE Electronics Express, Vol. 7, No. 6, pp. 440-446, Mar. 2010

Conference Papers

  • Development of Virtual CUDA Systems of Parallel Processing on CPU and GPGPU
  • Doohwan Oh, Sangpil Lee, Deokho Kim, Changmin Lee, and Won W. Ro
  • Workshop on Micro Architectural Support for Virtualization, Data Center Computing, and Clouds In Conjunction with MICRO 2010
  • (MASVDC Workshop 2010)
  • Atlanta, USA, Dec. 5, 2010
  • Implementing FFT using SPMD style of OpenMP
  • Tien-Hsiung Weng, Sheng-Wei Huang, Won Woo Ro, and Kuan-Ching Li
  • In Proc. of the 6th International Conference on Networked Computing and Advanced Information Management
  • (NCM 2010)
  • Seoul, Korea, Aug. 16 - 18, 2010
  • Accelerated Reconstruction Using Parallel Computing for Spiral Spectroscopic Imaging
  • Dong H. Kim, Yun H. Oh, Yun H. Nam, M. Gu, and Won W. Ro
  • In Proc. of 2010 International Society for Magnetic Resonance in Medicine Annual Meeting
  • (2010 ISMRM Annual Meeting)
  • Stockholm, Sweden, May 1 - 7, 2010
  • FPGA Implementation of Highly Parallelized Decoder Logic for Network Coding
  • Sunwoo Kim and Won W. Ro
  • In Proc. of Eighteenth ACM/SIGDA International Symposium on Field-Programmable Gate Arrays
  • (FPGA 2010)
  • Monterey, USA, Feb. 21 - 23, 2010

2009

Journal Papers

  • A Complexity-Effective Microprocessor Design with Decoupled Dispatch Queues and Prefetching
  • Won W. Ro and Jean-Luc Gaudiot
  • Parallel Computing, Vol. 35, No. 5, pp. 255-268, May 2009

Conference Papers

  • Evaluation of Cache Coherence Protocols on Multi-Core Systems with Linear Workloads
  • Yong J. Jang and Won W. Ro
  • In Proc. of 2009 International Colloquium on Computing, Communication, Control, and Management
  • (CCCM 2009)
  • Sanya, China, Aug. 8 - 9, 2009
  • Comparing Open Source Web Services: gSoap and AXIS
  • Jongwook Woo and Won W. Ro
  • In Proc. of the 24th International Technical Conference on Circuits/Systems, Computers and Communications
  • (ITC-CSCC 2009)
  • Jeju Island, Korea, July 5 - 8, 2009
  • Efficient Parallelized Network Coding for P2P File Sharing Applications
  • Karam Park, Joon-Sang Park, and Won W. Ro
  • In Proc. of the 4th International Conference on Grid and Pervasive Computing
  • (GPC 2009)
  • Geneva, Switcherland, May 4 - 8, 2009
  • Fully Pipelined Hardware Implementation of 128-bit SEED Block Cipher Algorithm
  • Jaeyoung Yi, Karam Park, Joonseok Park, and Won W. Ro
  • In Proc. of the 5th International Workshop on Applied Reconfigurable Computing
  • (ARC 2009)
  • Karlsruhe, Germany, Mar. 16 - 18, 2009

Book Chapters

  • Programmability and Scalability on Multi-Core Architectures
  • Jaeyoung Yi, Yong J. Jang, Doohwan Oh, and Won W. Ro
  • Chapter in "Handbook of Research on Scalable Computing Technologies", edited by Kuan-Ching Li, Ching-Hsien Hsu, Laurence Tianruo Yang, Jack Dongarra, and Hans Zima, Information Science Reference, 2009

2008

Journal Papers

  • Efficient Peer-to-Peer File Sharing Using Network Coding in MANET
  • Uichin Lee, Joon-Sang Park, Seung-Hoon Lee, Won W. Ro, Giovanni Pau, and Mario Gerla
  • Journal of Communications and Networks, Vol. 10, No. 4, Dec. 2008
  • A Low-Complexity Microprocessor Design with Speculative Pre-Execution
  • Won W. Ro and Jean-Luc Gaudiot
  • Journal of Systems Architecture, Vol. 54, No. 12, pp. 1101-1112, Dec. 2008
  • Performance Evaluation of Programming Models for SMP-Based Clusters
  • Myungho Lee, Neungsoo Park, Won W. Ro, and Kuan-Ching Li
  • Journal of the Chinese Institute of Engineers, Vol. 31, No. 7, pp. 1181-1188, Dec. 2008
  • Simultaneous Thin-Thread Processors for Low-Power Embedded Systems
  • Won W. Ro, Jaeyoung Yi, Joon-Sang Park, and Joonseok Park
  • IEICE Electronics Express, Vol. 5, No. 19, pp. 802-808, Oct. 2008
  • Delay Analysis of Car-to-Car Reliable Data Delivery Strategies Based on Data Mulling with Network Coding
  • Joon-Sang Park, Uichin Lee, Soon Young Oh, Mario Gerla, Desmond Siumen Lun, Won W. Ro, and Joonseok Park
  • IEICE Transactions on Information and Systems, Vol. E91-D, No. 10, Oct. 2008

Conference Papers

  • Parallel Algorithms for Steiner Tree Problem
  • Joon-Sang Park, Won W. Ro, Handuck Lee, and Neungsoo Park
  • In Proc. of the 3rd International Conference on Convergence and Hybrid Information Technology
  • (ICHIT 2008)
  • Busan, Korea, Nov. 11 - 13, 2008

2006

Journal Papers

  • Design and Evaluation of a Hierarchical Decoupled Architecture
  • Won W. Ro, Stephen P. Crago, Alvin M. Despain, and Jean-Luc Gaudiot
  • Journal of Supercomputing, Springer, Vol. 38, No. 3, pp. 237-259, Dec. 2006
  • Speculative Pre-Execution Assisted by Compiler (SPEAR)
  • Won W. Ro and Jean-Luc Gaudiot
  • Journal of Parallel and Distributed Computing, Elsevier, Vol. 66, No. 8, pp. 1076-1089, Aug. 2006

Conference Papers

  • Design and Effectiveness of Small-Sized Decoupled Dispatch Queues
  • Won W. Ro and Jean-Luc Gaudiot
  • In Proc. of European Conference on Parallel Computing - LNCS
  • (EURO-PAR 2006)
  • Dresden, Germany, Aug. 29 - Sep. 1, 2006

2005

Conference Papers

  • A Low-Complexity Issue Queue Design with Speculative Pre-Execution
  • Won W. Ro and Jean-Luc Gaudiot
  • In Proc. of the 12th International Conference on High Performance Computing
  • (HiPC 2005)
  • Goa, India, Dec. 18 - 21, 2005

Book Chapters

  • Techniques to Improve Performance Beyond Pipelining: Superpipelining, Superscalar, and VLIW
  • Jean-Luc Gaudiot, Jung-Yup Kang, and Won Woo Ro
  • Chapter in "Computer Architecture", a volume of "Advance in Computers", edited by Ali R.Hurson, Elsevier, 2005

2004

Conference Papers

  • SPEAR: A Hybrid Model for Speculative Pre-Execution
  • Won W. Ro and Jean-Luc Gaudiot
  • In Proc. of the 18th International Parallel and Distributed Processing Symposium
  • (IPDPS 2004)
  • Santa Fe, New Mexico, 2004

2003

Conference Papers

  • HiDISC: A Decoupled Architecture for Data-Intensive Applications
  • Won W. Ro, Jean-Luc Gaudiot, Stephen P. Crago, and Alvin M. Despain
  • In Proc. of the 17th International Parallel and Distributed Processing Symposium
  • (IPDPS 2003)
  • Nice, France, Apr. 22 - 26, 2003
  • Compiler Support for Dynamic Speculative Pre-Execution
  • Won W. Ro and Jean-Luc Gaudiot
  • In Proc. of the 7th Annual Workshop on Interaction between Compilers and Computer Architectures
  • (INTERACT-7) in conjunction with HPCA-9
  • Anaheim, California, Feb. 8, 2003

2000

Conference Papers

  • Memory Latency: to Tolerate or to Reduce?
  • Amol Bakshi, Jean-Luc Gaudiot, Wen-Yen Lin, Manil Makhija, Viktor K. Prasanna, Wonwoo Ro, and Chulho Shin
  • In Proc. of the 12th Symposium on Computer Architecture and High Performance Computing
  • (SBAC-PAD'00)
  • Sao Pedro, Brazil, Oct. 24 - 27, 2000
  • A High-Performance, Hierarchical Decoupled Architecture
  • Stephen P. Crago, Alvin Despain, Jean-Luc Gaudiot, Manil Makhija, Wonwoo Ro, and Apoorv Srivastava
  • In Proc. of the Memory access Decoupling for superscalar and multiple issue Architectures
  • (MEDEA) Workshop in conjunction with PACT 2000
  • Philadelphia, Oct. 15, 2000
  • A Reliable Cluster Computing with a New Checkpointing RAID-x Architecture
  • Kai Hwang, Hai Jin, Roy Ho, and Wonwoo Ro
  • In Proc. of the 9th Heterogeneous Computing Workshop
  • (HCW)
  • Cancun, Mexico, May 1, 2000

All Publications

In Press

  • Dynamic Resizing on Active Warps Scheduler to Hide Operation Stalls on GPUs
  • Myung Kuk Yoon, Yunho Oh, Sangpil Lee, Seung Hun Kim, Deokho Kim, and Won Woo Ro
  • IEEE Transactions on Parallel and Distributed Systems, Accepted

2017

Journal Papers

  • Improving Energy Efficiency of GPUs through Data Compression and Compressed Execution
  • Sangpil Lee, Keunsoo Kim, Gunjae Koo, Hyeran Jeon, Murali Annavaram, and Won Woo Ro
  • IEEE Transactions on Computers, Vol. 66, No. 5, pp. 834-847, May 2017
  • Dynamic Load Balancing of Dispatch Scheduling for Solid State Disks
  • Myunghyun Jo and Won Woo Ro
  • IEEE Transactions on Computers, Vol. 66, No. 6, pp. 1034-1047, June 2017

Conference Papers

  • Access Pattern-Aware Cache Management for Improving Data Utilization in GPU
  • Gunjae Koo, Yunho Oh, Won Woo Ro, and Murali Annavaram
  • The 44rd ACM/IEEE International Symposium on Computer Architecture
  • (ISCA 2017)
  • Torronto, Canada, Jun. 24 - 28, 2017
  • Dynamic Warp Scheduler Selection Policy Using Linear Regression for GPUs
  • Hyunjune Shin, Kyungmin Lee, I Poom Jeong, Jong Hyun Park, and Won Woo Ro
  • The 16th International Conference on Electronics, Information and Communication
  • (ICEIC 2017)
  • Phuket, Thailand, Jan. 11 - 14, 2017
  • Exploiting L2 Cache Sensitivity in Artificial Neural Network on GPUs
  • Seihoon Park, Yoonsoo Kim, Minsik Kim, and Won Woo Ro
  • The 16th International Conference on Electronics, Information and Communication
  • (ICEIC 2017)
  • Phuket, Thailand, Jan. 11 - 14, 2017
  • Optimizing Intersection and Reflection Step of Geometrical Optics using GPUs
  • Hyun Jin Chung, Myung Kuk Yoon, and Won Woo Ro
  • The 16th International Conference on Electronics, Information and Communication
  • (ICEIC 2017)
  • Phuket, Thailand, Jan. 11 - 14, 2017
  • Analysis of Error Tolerance in Convolution Neural Networks
  • Sangheon Kwon, Jong Hyun Park, and Won Woo Ro
  • The 16th International Conference on Electronics, Information and Communication
  • (ICEIC 2017)
  • Phuket, Thailand, Jan. 11 - 14, 2017

2016

Journal Papers

  • Server Side, Play Buffer Based Quality Control for Adaptive Media Streaming
  • Keunsoo Kim, Benjamin Y. Cho, and Won Woo Ro
  • Multimedia Tools and Applications, Vol. 75, No. 10, pp. 5397-5415, May 2016
  • Exploiting Thread-Level Parallelism on HEVC by Employing Reference Dependency Graph
  • Minwoo Kim, Deokho Kim, Kyungah Kim, and Won Woo Ro
  • IEEE Transactions on Circuits and Systems for Video Technology, Vol. 26, No. 4, pp. 736-749, Apr. 2016
  • Parallel GPU Architecture Simulation Framework Exploiting Architectural-Level Parallelism with Timing Error Prediction
  • Sangpil Lee and Won Woo Ro
  • IEEE Transactions on Computers, Vol. 65, No. 4, pp. 1253-1265, Apr. 2016

Conference Papers

  • Measuring Error-Tolerance in SRAM Architecture on Hardware Accelerated Neural Network
  • Sangheon Kwon, Kyungmin Lee, Yoonsoo Kim, Kyungah Kim, Changmin Lee, and Won Woo Ro
  • The 1st IEEE International Conference on Consumer Electronics Asia
  • (ICCE-ASIA 2016)
  • Seoul, Korea, Oct. 26 - 28, 2016
  • Virtual Thread: Maximizing Thread-Level Parallelism beyond GPU Scheduling Limit
  • Myung Kuk Yoon, Keunsoo Kim, Sangpil Lee, Won Woo Ro, and Murali Annavaram
  • The 43rd ACM/IEEE International Symposium on Computer Architecture
  • (ISCA 2016)
  • Seoul, Korea, Jun. 18 - 22, 2016
  • APRES: Improving Cache Efficiency by Exploiting Load Characteristics on GPUs
  • Yunho Oh, Keunsoo Kim, Myung Kuk Yoon, Jong Hyun Park, Yongjun Park, Won Woo Ro, and Murali Annavaram
  • The 43rd ACM/IEEE International Symposium on Computer Architecture
  • (ISCA 2016)
  • Seoul, Korea, Jun. 18 - 22, 2016
  • Warped-Slicer: Efficient Intra-SM Slicing through Dynamic Resource Partitioning for GPU Multiprogramming
  • Qiumin Xu, Hyeran Jeon, Keunsoo Kim, Won Woo Ro, and Murali Annavaram
  • The 43rd ACM/IEEE International Symposium on Computer Architecture
  • (ISCA 2016)
  • Seoul, Korea, Jun. 18 - 22, 2016
  • Warped-Preexecution: A GPU Pre-execution Approach for Improving Latency Hiding
  • Keunsoo Kim, Sangpil Lee, Myung Kuk Yoon, Gunjae Koo, Won Woo Ro, and Murali Annavaram
  • The 22nd IEEE Symposium on High Performance Computer Architecture
  • (HPCA 2016)
  • Barcelona, Spain, Mar. 12 - 16, 2016
  • Accelerating Forwading Computation of ANN using CUDA
  • Jong Hyun Park and Won Woo Ro
  • The 15th International Conference on Electronics, Information and Communication
  • (ICEIC 2016)
  • Danang, Vietnam, Jan. 27 - 30, 2016
  • Fairness-Aware Thread Scheduling for Multithreaded Program using Intel Software Guarded Extensions
  • Won Jeon, Seung Hun Kim, and Won Woo Ro
  • The 15th International Conference on Electronics, Information and Communication
  • (ICEIC 2016)
  • Danang, Vietnam, Jan. 27 - 30, 2016

2015

Journal Papers

  • A Performance-Energy Model to Evaluate Single Thread Execution Acceleration
  • Seung Hun Kim, Dohoon Kim, Changmin Lee, Won Seob Jeong, Won Woo Ro, and Jean-Luc Gaudiot
  • IEEE Computer Architecture Letters, Vol.14, No.2, pp. 99-102, Dec. 2015
  • Dynamic Load Balancing of Parallel SURF with Vertical Partitioning
  • Deokho Kim, Minwoo Kim, Kyungah Kim, Minyong Sung, and Won Woo Ro
  • IEEE Transactions on Parallel and Distributed Systems, Vol. 26, No. 12, pp. 3358-3370, Dec. 2015
  • Network Variation and Fault Tolerant Performance Acceleration in Mobile Devices with Simultaneous Remote Execution
  • Keunsoo Kim, Benjamin Y. Cho, Won Woo Ro, and Jean-Luc Gaudiot
  • IEEE Transactions on Computers, Vol. 64, No. 10, pp. 2862-2874, Oct. 2015
  • Highly Secure Mobile Devices Assisted with Trusted Cloud Computing Environments
  • Doohwan Oh, Ilkyu Kim, Keunsoo Kim, Sang-Min Lee, and Won Woo Ro
  • ETRI Journal, Vol. 37, No. 2, pp. 348-358, Apr. 2015

Conference Papers

  • True Motion Compensation With Feature Detection for Frame Rate Up-Conversion
  • Kyungah Kim, Minwoo Kim, Deokho Kim, and Won Woo Ro
  • The 2015 IEEE International Conference on Image Processing
  • (ICIP 2015)
  • Quebec City, Canada, Sep. 27 - 30, 2015
  • An Accelerated Separable Median Filter with Sorting Networks
  • Minsik Kim, Deokho Kim, Minyong Sung, and Won Woo Ro
  • The 2015 IEEE International Conference on Image Processing
  • (ICIP 2015)
  • Quebec City, Canada, Sep. 27 - 30, 2015
  • Contention-Free Fair Queuing for High-Speed Storage with RAID-0 Architecture
  • Myung Hyun Jo and Won Woo Ro
  • The 17TH IEEE International Conference on High Performance Computing and Communications
  • (HPCC 2015)
  • New York, USA, Aug. 24 - 26, 2015
  • Integrity Protection for Big Data Processing with Dynamic Redundancy Computation
  • Zhimin Gao, Nicholas DeSalvo, Pham Dang Khoa, Seung Hun Kim, Lei Xu, Won Woo Ro, Rakesh M. Verma,
    and Weidong Shi
  • The 2015 IEEE International Conference on Autonomic Computing
  • (ICAC 2015)
  • Grenoble, France, July 7 - 10, 2015
  • Improving Pipeline Utilization with Two-Level Instruction Issue on GPUs
  • Yunho Oh, Jong Hyun Park, and Won Woo Ro
  • The 30th International Techinical Conference on Circuits/Systems, Computers and Communicaions
  • (ITC-CSCC 2015)
  • Seoul, Korea, Jun. 29 - July 2, 2015
  • Accelerating ELMs on the GPU Toward Real-Time Training on Large Scale Data Sets
  • Han Kyul Kim, Jong Hyun Park, and Won Woo Ro
  • The 30th International Technical Conference on Circuits/Systems, Computers and Communications
  • (ITC-CSCC 2015)
  • Seoul, Korea, Jun. 29 - July 2, 2015
  • A Frequency Scaling Model for Energy Efficient DVFS Designs based on Circuit Delay Optimization
  • Ki Bum Chun, Changmin Lee and Won Woo Ro
  • The 19th IEEE International Symposium on Consumer Electronics
  • (ISCE 2015)
  • UPM, Madrid, Spain, Jun. 24 - 26, 2015
  • Another Look at Secure Big Data Processing: a Formal Framework and a Practical Approach
  • Lei Xu, Seung Hun Kim, Won Woo Ro, and Weidong Shi
  • The 8th IEEE International Conference on Cloud Computing
  • (Cloud'15, Application Track)
  • New York, USA, Jun. 27 - July 2, 2015
  • Enhancing Software Dependability and Security with Hardware Supported Instruction Address Space Randomization
  • Seung Hun Kim, Lei Xu, Ziyi Liu, Zhiqiang Lin, Won Woo Ro, and Weidong Shi
  • The 45th Annual IEEE/IFIP International Conference on Dependable Systems and Networks
  • (DSN 2015)
  • Rio de Janerio, Brazil, Jun. 22 - 25, 2015
  • Warped-Compression: Enabling Power Efficient GPUs through Register Compression
  • Sangpil Lee, Keunsoo Kim, Gunjae Koo, Hyeran Jeon, Won Woo Ro, and Murali Annavaram
  • The 42nd ACM/IEEE International Symposium on Computer Architecture
  • (ISCA 2015)
  • Portland, OR, USA, Jun. 13 - 17, 2015
  • DRAW: Investigating Benefits of Adaptive Fetch Group Size on GPU
  • Myung Kuk Yoon, Yunho Oh, Sangpil Lee, Seung Hun Kim, Deokho Kim, and Won Woo Ro
  • The 2015 IEEE International Symposium on Performance Analysis of Systems and Software
  • (ISPASS 2015)
  • Philadelphia, PA, USA, Mar. 29 - 31, 2015

2014

Journal Papers

  • A Malicious Pattern Detection Engine for Embedded Security Systems in Internet of Things
  • Doohwan Oh, Deokho Kim, and Won Woo Ro
  • Sensors, Vol. 14, No. 12, pp. 24188-24211, Dec. 2014
  • C-Lock: Energy Efficient Synchronization for Embedded Multicore Systems
  • Seung Hun Kim, Sang Hyong Lee, Minje Jun, Byunghoon Lee, Won Woo Ro, Eui-Young Chung,
    and Jean-Luc Gaudiot
  • IEEE Transactions on Computers, Vol. 63, No. 8, pp. 1962-1974, Aug. 2014
  • Swarm Processor System: Hardware Process Scheduler based Energy Efficient Multi-Core System
  • Won Seob Jeong, Seung Hun Kim, Sang-Min Lee, and Won Woo Ro
  • IEICE Electronics Express, Vol. 11, No. 14, pp. 20140424, July 2014
  • Complexity-Effective Contention Management with Dynamic Backoff for Transactional Memory Systems
  • Seung Hun Kim, Dongmin Choi, Won Woo Ro, and Jean-Luc Gaudiot
  • IEEE Transactions on Computers, Vol. 63, No. 7, pp. 1696-1708, July 2014
  • Architectural Investigation of Matrix Data Layout on Multicore Processors
  • Minwoo Kim and Won Woo Ro
  • Future Generation Computer Systems, Vol. 37, pp. 64-75, July 2014
  • Exploiting Implementation Diversity and Partial Connection of Routers in Application-Specific Network-on-Chip Topology Synthesis
  • Minje Jun, Won Woo Ro, and Eui-Young Chung
  • IEEE Transactions on Computers, Vol. 63, No. 6, pp. 1434-1445, Jun. 2014
  • Accelerating MapReduce Framework on Multi-GPU Systems
  • Hai Jiang, Yi Chen, Zhi Qiao, Kuan-Ching Li, Won Woo Ro, and Jean-Luc Gaudiot
  • Cluster Computing, Vol. 17, No. 2, pp. 293-301, Jun. 2014
  • Boosting CUDA Applications with CPU-GPU Hybrid Computing
  • Changmin Lee, Won Woo Ro, and Jean-Luc Gaudiot
  • International Journal of Parallel Programming, Vol. 42, No. 2, pp. 384-404, Apr. 2014
  • This is an extension of our INTERACT-16 paper which has been selected as one of the best papers and recommended to IJPP.

Conference Papers

  • LUT based Secure Cloud Computing ‐ an Implementation using FPGAs
  • Lei Xu, Pham Dang Khoa, Seung Hun Kim, Won Woo Ro, and Weidong Shi
  • 2014 International Conference on ReConFigurable Computing and FPGAs
  • (ReConFig 2014)
  • Cancun, Mexico, Dec. 7 - 10, 2014
  • Workload Synthesis: Generating Benchmark Workloads from Statistical Execution Profile
  • Keunsoo Kim, Changmin Lee, Jung Ho Jung, and Won Woo Ro
  • IEEE International Symposium on Workload Characterization
  • (IISWC 2014)
  • Raleigh, North Carolina, USA, Oct. 26 - 28, 2014
  • Accelerating Gesture Recognition Algorithm Using Coarse Grained Reconfigurable Architectures
  • Minsik Kim, Deokho Kim, Minyong Sung, Wonjae Lee, Jaehyun Kim, and Won Woo Ro
  • The 4th International Conference on Audio, Language and Image Processing
  • (ICALIP 2014)
  • Shanghai, China, July 7 - 9, 2014
  • A Micro-benchmark Suite to Understand Micro-Architectural Differences between Processors
  • Changmin Lee, Keunsoo Kim, Jung Ho Jung, and Won Woo Ro
  • The 29th International Technical Conference on Circuits/Systems, Computers and Communications
  • (ITC-CSCC 2014)
  • Phuket, Thailand, July 1 - 4, 2014
  • Maximizing DRAM Performance using Selective Operating Frequency Boosting
  • Jung Ho Jung, Seung Hun Kim, Changmin Lee, and Won Woo Ro
  • The 18th International Symposium on Consumer Electronics
  • (ISCE 2014)
  • Jeju, Korea, Jun. 22 - 25, 2014
  • Workload and Variation Aware Thread Scheduling for Heterogeneous Multi-processor
  • Seungwon Lee and Won Woo Ro
  • The 18th International Symposium on Consumer Electronics
  • (ISCE 2014)
  • Jeju, Korea, Jun. 22 - 25, 2014
  • Best paper award, Bronze prize
  • DPM: Data Partitioning Method for Pipelined MapReduce on GPU
  • Myung Hyun Jo and Won Woo Ro
  • The 18th International Symposium on Consumer Electronics
  • (ISCE 2014)
  • Jeju, Korea, Jun. 22 - 25, 2014
  • Accelerating HEVC Transcoder by Exploiting Decoded Quadtree
  • Minyong Sung, Minwoo Kim, Minsik Kim, and Won Woo Ro
  • The 18th International Symposium on Consumer Electronics
  • (ISCE 2014)
  • Jeju, Korea, Jun. 22 - 25, 2014
  • Multicore Speedup Models using Frequency Scaling with Fixed Power Budget
  • Seungwon Lee, Seung Hun Kim, and Won Woo Ro
  • The 13th International Conference on Electronics, Information and Communication
  • (ICEIC 2014)
  • Kota Kinabalu, Malaysia, Jan. 15 - 18, 2014
  • Hyper Threading-aware Virtual Machine Migration
  • Chungmu Oh, and Won Woo Ro
  • The 13th International Conference on Electronics, Information and Communication
  • (ICEIC 2014)
  • Kota Kinabalu, Malaysia, Jan. 15 - 18, 2014
  • Development of Efficient VCPU Pinning Mechanism in Xen
  • Kyung Yoon Min, Seung Hun Kim, and Won Woo Ro
  • The 13th International Conference on Electronics, Information and Communication
  • (ICEIC 2014)
  • Kota Kinabalu, Malaysia, Jan. 15 - 18, 2014

2013

Journal Papers

  • Parallelized Sub-Resource Loading for Web Rendering Engine
  • Deokho Kim, Changmin Lee, Sangpil Lee, and Won Woo Ro
  • Journal of Systems Architecture, Vol. 59, No. 9, pp. 785-793, Oct. 2013
  • Design and Evaluation of Random Linear Network Coding Accelerators on FPGAs
  • Sunwoo Kim, Won Seob Jeong, Won Woo Ro, and Jean-Luc Gaudiot
  • ACM Transactions on Embedded Computing Systems, Vol.13, No. 1, pp. 1-24, Aug. 2013
  • GPU-Friendly Parallel Genome Matching with Tiled Access and Reduced State Transition Table
  • Yunho Oh, Doohwan Oh, and Won Woo Ro
  • International Journal of Parallel Programming, Vol. 41, No. 4, pp. 526-551, Aug. 2013
  • A Distributed Signature Detection Method for Detecting Intrusions in Sensor Systems
  • Ilkyu Kim, Doohwan Oh, Myung Kuk Yoon, Kyueun Yi, and Won Woo Ro
  • Sensors, Vol. 13, No. 4, pp. 3998-4016, Mar. 2013
  • Exploiting SIMD Parallelism on Dynamically Partitioned Parallel Network Coding for P2P Systems
  • Deokho Kim, Karam Park, and Won Woo Ro
  • Computers & Electrical Engineering, Vol. 39, No. 1, pp. 55-56, Jan. 2013
  • Benefits of Using Parallelized Non-Progressive Network Coding
  • Minwoo Kim, Karam Park, and Won Woo Ro
  • Journal of Network and Computer Applications, Vol. 36, No. 1, pp. 293-305, Jan. 2013
  • Importance of Coherence Protocols with Network Applications on Multi-Core Processors
  • Kyueun Yi, Won Woo Ro, and Jean-Luc Gaudiot
  • IEEE Transactions on Computers, Vol. 62, No. 1, pp. 6-15, Jan. 2013

Conference Papers

  • Effcient Descriptor-Filtering Algorithm for Speeded Up Robust Features Matching
  • Minwoo Kim, Deokho Kim, Kyungah Kim, and Won Woo Ro
  • The 5th FTRA International Conference on Computer Science and its Applications
  • (CSA-13)
  • Danang, Vietnam, Dec. 18 - 21, 2013
  • XSD: Accelerating MapReduce by Harnessing the GPU inside an SSD
  • Benjamin Y. Cho, Won Seob Jeong, Doohwan Oh, and Won Woo Ro
  • The 1st Workshop on Near-Data Processing. In conjunction with the MICRO-46
  • (WoNDP 2013)
  • Davis, USA, Dec. 8, 2013
  • Mark-Sharing: A Parallel Garbage Collection Algorithm for Low Synchronization Overhead
  • Hyunkyu Park, Changmin Lee, Seung Hun Kim, Won Woo Ro and Jean-Luc Gaudiot
  • The 19th IEEE International Conference on Parallel and Distributed Systems
  • (ICPADS 2013)
  • Seoul, Korea, Dec. 15 - 18, 2013
  • Leveraging Effectiveness of Contention Management for Transactional Memory Systems with Performance Monitoring
  • Keunsoo Kim, Seung Hun Kim, Sang-min Lee, and Won Woo Ro
  • The 28th International Technical Conference on Circuits/Systems, Computer and Communications
  • (ITC-CSCC 2013)
  • Yeosu, Korea, Jun. 30 - July 3, 2013
  • MGMR: Multi-GPU Based MapReduce
  • Yi Chen, Zhi Qiao, Hai Jiang, Kuan-Ching Li, Won Woo Ro
  • The 8th International Conference on Grid and Pervasive Computing
  • (GPC 2013)
  • Seoul, Korea, May. 9 - 11, 2013
  • Parallel GPU Architecture Simulation Framework Exploiting Work Allocation Unit Parallelism
  • Sangpil Lee and Won Woo Ro
  • The 2013 IEEE International Symposium on Performance Analysis of Systems and Software
  • (ISPASS 2013)
  • Austin, TX, USA, Apr. 21 - 23, 2013
  • Directory Centralized Ring-based Interconnection for Multi-Core Systems
  • Myung Kuk Yoon, Sangpil Lee, Deokho Kim, and Won Woo Ro
  • The 12th International Conference on Electronics, Information and Communication
  • (ICEIC 2013)
  • Bali, Indonesia, Jan. 30 - Feb. 2, 2013
  • Parallel Garbage Collection with Transactional Memory
  • Hyunkyu Park, Changmin Lee, and Won Woo Ro
  • The 12th International Conference on Electronics, Information and Communication
  • (ICEIC 2013)
  • Bali, Indonesia, Jan. 30 - Feb. 2, 2013

2012

Journal Papers

  • Multi-Threading and Suffix Grouping on Massive Multiple Pattern Matching Algorithm
  • Doohwan Oh and Won Woo Ro
  • The Computer Journal, Vol. 55, No. 11, pp. 1331-1346, Nov. 2012
  • Offloading of Media Transcoding for High-Quality Multimedia Services
  • Seung Hun Kim, Keunsoo Kim, Changmin Lee, and Won Woo Ro
  • IEEE Transactions on Consumer Electronics, Vol. 58, No. 2, pp. 691-699, May 2012
  • Design of a Power-Efficient Parallel Pipelined Bloom Filter
  • Deokho Kim, Doohwan Oh, and Won Woo Ro
  • Electronics Letters, Vol. 48, No. 7, pp. 367-369, Mar. 2012
  • Reconfigurable and Parallelized Network Coding Decoder for VANETs
  • Sunwoo Kim and Won Woo Ro
  • Mobile Information Systems, Vol. 8, No. 1, pp. 45-59, Feb. 2012
  • Accelerated Network Coding with Dynamic Stream Decomposition on Graphics Processing Unit
  • Sangpil Lee and Won Woo Ro
  • The Computer Journal, Vol. 55, No. 1, pp. 21-34, Jan. 2012

Conference Papers

  • On Migration and Consolidation of VMs in Hybrid CPU-GPU Environments
  • Kuan-Ching Li, Keunsoo Kim, Won Woo Ro, Tien-Hsiung Weng, Che-Lun Hung, Chen-Hao Ku, Albert Cohen, and Jean-Luc Gaudiot
  • International Conference on Intelligent Technologies and Engineering Systems
  • (ICITES 2012) - LNEE
  • Changhua, Taiwan, Dec. 13-15, 2012
  • Conflict Avoidance Scheduling using Grouping List for Transactional Memory
  • Dongmin Choi, Seung Hun Kim, and Won Woo Ro
  • The 17th International Workshop on High-Level Parallel Programming Models and Supportive Environments
  • (HIPS-17)
  • Shanghai, China, May 21, 2012
  • Cooperative Heterogeneous Computing for Parallel Processing on CPU/GPU Hybrids
  • Changmin Lee, Won Woo Ro, and Jean-Luc Gaudiot
  • The 16th Workshop on Interaction between Compilers and Computer Architectures
  • (INTERACT-16)
  • New Orleans, USA, Feb. 25 - 29, 2012
  • Matrix Data Layout Optimization for Multi-Core Architectures
  • Minwoo Kim, and Won Woo Ro
  • The 11th International Conference on Electronics, Information and Communication
  • (ICEIC 2012)
  • Jeongseon, Korea, Feb. 1 - 3, 2012
  • The Effect of Concurrency Control in Transactional Memory Systems
  • Seung Hun Kim, Dongmin Choi, and Won Woo Ro
  • The 11th International Conference on Electronics, Information and Communication
  • (ICEIC 2012)
  • Jeongseon, Korea, Feb. 1 - 3, 2012
  • Adaptive Replacement Cache in Transactional Memory
  • Dongmin Choi, Hyunkyu Park, Seung Hun Kim, and Won Woo Ro
  • The 11th International Conference on Electronics, Information and Communication
  • (ICEIC 2012)
  • Jeongseon, Korea, Feb. 1 - 3, 2012

2011

Journal Papers

  • A Novel Sequential Tree Algorithm Based on Scoreboard for MPI Broadcast Communication
  • Won-young Chung, Jae-won Park, Seung-Woo Lee, Won Woo Ro, and Yong-surk Lee
  • IEICE Transactions on Information and Systems, Vol 94, No. 12, pp. 2523-2527, December. 2011
  • Network Coding on Heterogeneous Multi-Core Processors for Wireless Sensor Networks
  • Deokho Kim, Karam Park, and Won W. Ro
  • Sensors, Vol 11, No. 8, pp. 7908-7933, Aug. 2011
  • A Low-Cost Standard Mode MPI Hardware Unit for Embedded MPSoC
  • Won-Young Chung, Ha-Young Jeong, Won W. Ro, and Yong-Surk Lee
  • IEICE Transactions on Information and Systems, Vol. E94-D, No.7, pp. 1497-1501, July 2011

Conference Papers

  • Parallel Transpose of Matrix Multiplication Based on the Tiling Algorithm
  • Minwoo Kim, Yong J. Jang, and Won W. Ro
  • The 54th IEEE International Midwest Symposium on Circuits and Systems
  • (MWSCAS 2011)
  • Seoul, Korea, Aug. 7 - 10, 2011
  • Performance Evaluation of Adaptive Progressive Network Coding
  • Deokho Kim, Karam Park, and Won W. Ro
  • The 54th IEEE International Midwest Symposium on Circuits and Systems
  • (MWSCAS 2011)
  • Seoul, Korea, Aug. 7 - 10, 2011

2010

Journal Papers

  • Multithreaded Pattern Matching Algorithm with Data Rearrangement
  • Doohwan Oh, Seung Hun Kim, and Won W. Ro
  • IEICE Electronics Express, Vol. 7, No. 20, pp. 1520-1526, Oct. 2010
  • On Improving Parallelized Network Coding with Dynamic Partitioning
  • Karam Park, Joon-Sang Park, and Won W. Ro
  • IEEE Transactions on Parallel and Distributed Systems, Vol. 21, No. 11, pp. 1547-1560, Nov. 2010
  • Hardware Implementation of a Tessellation Accelerator for the OpenVG Standard
  • Seung Hun Kim, Yunho Oh, Karam Park, and Won W. Ro
  • IEICE Electronics Express, Vol. 7, No. 6, pp. 440-446, Mar. 2010

Conference Papers

  • Development of Virtual CUDA Systems of Parallel Processing on CPU and GPGPU
  • Doohwan Oh, Sangpil Lee, Deokho Kim, Changmin Lee, and Won W. Ro
  • Workshop on Micro Architectural Support for Virtualization, Data Center Computing, and Clouds In Conjunction with MICRO 2010
  • (MASVDC Workshop 2010)
  • Atlanta, USA, Dec. 5, 2010
  • Implementing FFT using SPMD style of OpenMP
  • Tien-Hsiung Weng, Sheng-Wei Huang, Won Woo Ro, and Kuan-Ching Li
  • In Proc. of the 6th International Conference on Networked Computing and Advanced Information Management
  • (NCM 2010)
  • Seoul, Korea, Aug. 16 - 18, 2010
  • Multi-Threaded Filtered BackProjection Algorithm on Multi-Core Processors
  • Yun H. Oh and Won W. Ro
  • The 10th International Conference on Electronics, Information, and Communication
  • (ICEIC 2010)
  • Cebu, Philippines, Jun. 30 - July 2, 2010
  • Accelerated Reconstruction Using Parallel Computing for Spiral Spectroscopic Imaging
  • Dong H. Kim, Yun H. Oh, Yun H. Nam, M. Gu, and Won W. Ro
  • In Proc. of 2010 International Society for Magnetic Resonance in Medicine Annual Meeting
  • (2010 ISMRM Annual Meeting)
  • Stockholm, Sweden, May 1 - 7, 2010
  • FPGA Implementation of Highly Parallelized Decoder Logic for Network Coding
  • Sunwoo Kim and Won W. Ro
  • In Proc. of Eighteenth ACM/SIGDA International Symposium on Field-Programmable Gate Arrays
  • (FPGA 2010)
  • Monterey, USA, Feb. 21 - 23, 2010

2009

Journal Papers

  • A Complexity-Effective Microprocessor Design with Decoupled Dispatch Queues and Prefetching
  • Won W. Ro and Jean-Luc Gaudiot
  • Parallel Computing, Vol. 35, No. 5, pp. 255-268, May 2009

Conference Papers

  • Evaluation of Cache Coherence Protocols on Multi-Core Systems with Linear Workloads
  • Yong J. Jang and Won W. Ro
  • In Proc. of 2009 International Colloquium on Computing, Communication, Control, and Management
  • (CCCM 2009)
  • Sanya, China, Aug. 8 - 9, 2009
  • Comparing Open Source Web Services: gSoap and AXIS
  • Jongwook Woo and Won W. Ro
  • In Proc. of the 24th International Technical Conference on Circuits/Systems, Computers and Communications
  • (ITC-CSCC 2009)
  • Jeju Island, Korea, July 5 - 8, 2009
  • Efficient Parallelized Network Coding for P2P File Sharing Applications
  • Karam Park, Joon-Sang Park, and Won W. Ro
  • In Proc. of the 4th International Conference on Grid and Pervasive Computing
  • (GPC 2009)
  • Geneva, Switcherland, May 4 - 8, 2009
  • Fully Pipelined Hardware Implementation of 128-bit SEED Block Cipher Algorithm
  • Jaeyoung Yi, Karam Park, Joonseok Park, and Won W. Ro
  • In Proc. of the 5th International Workshop on Applied Reconfigurable Computing
  • (ARC 2009)
  • Karlsruhe, Germany, Mar. 16 - 18, 2009

Book Chapters

  • Programmability and Scalability on Multi-Core Architectures
  • Jaeyoung Yi, Yong J. Jang, Doohwan Oh, and Won W. Ro
  • Chapter in "Handbook of Research on Scalable Computing Technologies", edited by Kuan-Ching Li, Ching-Hsien Hsu, Laurence Tianruo Yang, Jack Dongarra, and Hans Zima, Information Science Reference, 2009

2008

Journal Papers

  • Efficient Peer-to-Peer File Sharing Using Network Coding in MANET
  • Uichin Lee, Joon-Sang Park, Seung-Hoon Lee, Won W. Ro, Giovanni Pau, and Mario Gerla
  • Journal of Communications and Networks, Vol. 10, No. 4, Dec. 2008
  • A Low-Complexity Microprocessor Design with Speculative Pre-Execution
  • Won W. Ro and Jean-Luc Gaudiot
  • Journal of Systems Architecture, Vol. 54, No. 12, pp. 1101-1112, Dec. 2008
  • Performance Evaluation of Programming Models for SMP-Based Clusters
  • Myungho Lee, Neungsoo Park, Won W. Ro, and Kuan-Ching Li
  • Journal of the Chinese Institute of Engineers, Vol. 31, No. 7, pp. 1181-1188, Dec. 2008
  • Simultaneous Thin-Thread Processors for Low-Power Embedded Systems
  • Won W. Ro, Jaeyoung Yi, Joon-Sang Park, and Joonseok Park
  • IEICE Electronics Express, Vol. 5, No. 19, pp. 802-808, Oct. 2008
  • Delay Analysis of Car-to-Car Reliable Data Delivery Strategies Based on Data Mulling with Network Coding
  • Joon-Sang Park, Uichin Lee, Soon Young Oh, Mario Gerla, Desmond Siumen Lun, Won W. Ro, and Joonseok Park
  • IEICE Transactions on Information and Systems, Vol. E91-D, No. 10, Oct. 2008

Conference Papers

  • Parallel Algorithms for Steiner Tree Problem
  • Joon-Sang Park, Won W. Ro, Handuck Lee, and Neungsoo Park
  • In Proc. of the 3rd International Conference on Convergence and Hybrid Information Technology
  • (ICHIT 2008)
  • Busan, Korea, Nov. 11 - 13, 2008

2006

Journal Papers

  • Design and Evaluation of a Hierarchical Decoupled Architecture
  • Won W. Ro, Stephen P. Crago, Alvin M. Despain, and Jean-Luc Gaudiot
  • Journal of Supercomputing, Springer, Vol. 38, No. 3, pp. 237-259, Dec. 2006
  • Speculative Pre-Execution Assisted by Compiler (SPEAR)
  • Won W. Ro and Jean-Luc Gaudiot
  • Journal of Parallel and Distributed Computing, Elsevier, Vol. 66, No. 8, pp. 1076-1089, Aug. 2006

Conference Papers

  • Design and Effectiveness of Small-Sized Decoupled Dispatch Queues
  • Won W. Ro and Jean-Luc Gaudiot
  • In Proc. of European Conference on Parallel Computing - LNCS
  • (EURO-PAR 2006)
  • Dresden, Germany, Aug. 29 - Sep. 1, 2006

2005

Conference Papers

  • A Low-Complexity Issue Queue Design with Speculative Pre-Execution
  • Won W. Ro and Jean-Luc Gaudiot
  • In Proc. of the 12th International Conference on High Performance Computing
  • (HiPC 2005)
  • Goa, India, Dec. 18 - 21, 2005

Book Chapters

  • Techniques to Improve Performance Beyond Pipelining: Superpipelining, Superscalar, and VLIW
  • Jean-Luc Gaudiot, Jung-Yup Kang, and Won Woo Ro
  • Chapter in "Computer Architecture", a volume of "Advance in Computers", edited by Ali R.Hurson, Elsevier, 2005

2004

Conference Papers

  • SPEAR: A Hybrid Model for Speculative Pre-Execution
  • Won W. Ro and Jean-Luc Gaudiot
  • In Proc. of the 18th International Parallel and Distributed Processing Symposium
  • (IPDPS 2004)
  • Santa Fe, New Mexico, 2004

2003

Conference Papers

  • HiDISC: A Decoupled Architecture for Data-Intensive Applications
  • Won W. Ro, Jean-Luc Gaudiot, Stephen P. Crago, and Alvin M. Despain
  • In Proc. of the 17th International Parallel and Distributed Processing Symposium
  • (IPDPS 2003)
  • Nice, France, Apr. 22 - 26, 2003
  • Compiler Support for Dynamic Speculative Pre-Execution
  • Won W. Ro and Jean-Luc Gaudiot
  • In Proc. of the 7th Annual Workshop on Interaction between Compilers and Computer Architectures
  • (INTERACT-7) in conjunction with HPCA-9
  • Anaheim, California, Feb. 8, 2003

2000

Conference Papers

  • Memory Latency: to Tolerate or to Reduce?
  • Amol Bakshi, Jean-Luc Gaudiot, Wen-Yen Lin, Manil Makhija, Viktor K. Prasanna, Wonwoo Ro, and Chulho Shin
  • In Proc. of the 12th Symposium on Computer Architecture and High Performance Computing
  • (SBAC-PAD'00)
  • Sao Pedro, Brazil, Oct. 24 - 27, 2000
  • A High-Performance, Hierarchical Decoupled Architecture
  • Stephen P. Crago, Alvin Despain, Jean-Luc Gaudiot, Manil Makhija, Wonwoo Ro, and Apoorv Srivastava
  • In Proc. of the Memory access Decoupling for superscalar and multiple issue Architectures
  • (MEDEA) Workshop in conjunction with PACT 2000
  • Philadelphia, Oct. 15, 2000
  • A Reliable Cluster Computing with a New Checkpointing RAID-x Architecture
  • Kai Hwang, Hai Jin, Roy Ho, and Wonwoo Ro
  • In Proc. of the 9th Heterogeneous Computing Workshop
  • (HCW)
  • Cancun, Mexico, May 1, 2000