CK Luk's Publications:

  1. Minjang Kim, Nagesh B. Lakshminarayana, Hyesoon Kim, and Chi-Keung Luk.SD3: An Efficient Dynamic Data-Dependence Profiling Mechanism. To appear in IEEE Transactions on Computers.
  2. Yuan Tang, Rezaul Alam Chowdhury, Bradley C. Kuszmaul, Chi-Keung Luk, and Charles E. Leiserson.The Pochoir Stencil Compiler. In Proceedings of the 23rd ACM Symposium on Parallelism in Algorithms and Architectures (SPAA'2011), June 2011.
  3. Yuan Tang, Rezaul Alam Chowdhury, Chi-Keung Luk, and Charles E. Leiserson. Coding Stencil Computations Using the Pochoir Stencil-Specification Language. In HotPar'2011, June 2011.
  4. Chi-Keung Luk, Ryan Newton, William Hasenplaugh, Mark Hampton, and Geoff Lowney. A Synergetic Approach to Throughput Computing on x86-based Multicore Desktops. In IEEE Software Special Issue on Multiprocessor Desktops, Jan/Feb 2011.
  5. Minjang Kim, Hyesoon Kim, and Chi-Keung Luk. SD3: A Scalable Approach to Dynamic Data-Dependence Profiling. In Proceedings of the 2010 ACM/IEEE International Symposium on Microarchitecture (MICRO), December 2010.
  6. Minjang Kim, Hyesoon Kim, and Chi-Keung Luk. Prospector: A Dynamic Data-Dependence Profiler to Help Parallel Programming. In HotPar'2010, June 2010.
  7. Moshe Bach, Mark Charney, Robert Cohn, Elena Demikhovsky, Tevi Devor, Kim Hazelwood, Aamer Jaleel, Chi-Keung Luk, Gail Lyons, Harish Patil, and Ady Tal. Analyzing Parallel Programs with Pin. In IEEE Computer, 34-41, March 2010.
  8. Chi-Keung Luk, Sunpyo Sun, and Hyesoon Kim. Qilin: Exploiting Parallelism on Heterogeneous Multiprocessors with Adaptive Mapping. In Proceedings of the 2009 ACM/IEEE International Symposium on Microarchitecture (MICRO), December 2009.
  9. Aamer Jaleel, Robert S. Cohn, Chi-Keung Luk, and Bruce Jacob. CMP$im: A Pin-based On-the-Fly Multicore Cache Simulator. In The Fourth Annual Workshop on Modeling, Benchmarking and Simulation (MoBS), co-located with ISCA'2008.
  10. Prashanth Bungale and Chi-Keung Luk. PinOS: A Programmable Framework for Whole-System Dynamic Instrumentation. In Proceedings of the 3rd ACM/USENIX International Conference on Virtual Execution Environment (VEE), pages 137-147, June 2007.
  11. Heidi Pan, Krste Asanovic, Robert Cohn, and Chi-Keung Luk. Controlling Program Execution through Binary Instrumentation. In 1st Workshop on Binary Instrumentation and Application (WBIA), September 2005.
  12. Chi-Keung Luk, Robert Cohn, Robert Muth, Harish Patil, Artur Klauser, Geoff Lowney, Steven Wallace, Vijay Janapa Reddi, and Kim Hazelwood. Pin: Building Customized Program Analysis Tools with Dynamic Instrumentation. In Proceedings of the 2005 ACM SIGPLAN Conference on Programming Language Design and Implementation (PLDI), pages 190-200, June 2005.
  13. Chi-Keung Luk. Robert Muth, Harish Patil, Robert Cohn, and Geoff Lowney. Ispike: A Post-Link Optimizer for the Intel Itanium Architecture. In Proceedings of the 2nd International Conference on Code Generation and Optimization (CGO), pages 15-26, March 2004.
  14. Chi-Keung Luk, Robert Muth, Harish Patil, Geoff Lowney, Robert Cohn, and Richard Weiss. Profile-Guided Post-Link Stride Prefetching. In Proceedings of the 2002 International Conference on Supercomputing (ICS), pages 167-178, June 2002.
  15. Joel Emer, Pritpal Ahuja, Bathan Binkert, Roger Espasa, Toni Juan, Artur Klauser, Chi-Keung Luk Srilatha Manne, Shubhendu S. Mukherjee, Harish Patil, and Steven Wallace. Asim: A Performance Model Framework. In IEEE Computer 35(2): 68-76, February 2002.
  16. Dick Flower, Chi-Keung Luk, Robert Cohn, Geoff Lowney, Robert Muth, Harish Patil, and John Shakshober. Kernel Optimizations and Prefetch with the Spike Executable Optimizer. In 4th Workshop on Feedback-Directed and Dynamic Optimization (FDDO-4), December 2001.
  17. Chi-Keung Luk. Tolerating Memory Latency Through Software-Controlled Pre-Execution in Simultaneous Multithreading Processors. In Proceedings of the 28th Annual International Symposium on Computer Architecture (ISCA), pages 40-51, July 2001.
  18. Chi-Keung Luk and Todd C. Mowry. Architectural and Compiler Support for Effective Instruction Prefetching: A Cooperative Approach. In ACM Transactions on Computer Systems, 19(1): 71-109, February 2001.
  19. Chi-Keung Luk. Optimizing the Cache Performance of Non-Numeric Applications. Ph.D. Thesis, Department of Computer Science, University of Toronto, January 2000. (Nominated for the ACM Doctoral Dissertation Award).
  20. Todd C. Mowry and Chi-Keung Luk. Understanding Why Correlation Profiling Improves the Predictability of Data Cache Misses in Nonnumeric Applications. In IEEE Transactions on Computers, 49(4), April 2000.
  21. Chi-Keung Luk and Todd C. Mowry. Memory Forwarding: Enabling Aggressive Layout Optimizations by Guaranteeing the Safety of Data Relocation. In Proceedings of the 26th Annual International Symposium on Computer Architecture (ISCA), pages 88-99, May 1999.
  22. Chi-Keung Luk and Todd C. Mowry. Automatic Compiler-Inserted Prefetching for Pointer-Based Applications. In IEEE Transactions on Computers, 48(2): 134-141, February 1999.
  23. Chi-Keung Luk and Todd C. Mowry. Cooperative Prefetching: Compiler and Hardware Support for Effective Instruction Prefetching in Modern Microprocessors. In Proceedings of the 31st Annual International Symposium on Microarchitecture (MICRO), pages 182-193, December 1998.
  24. Chi-Keung Luk and Todd C. Mowry. Compiler and Hardware Support for Automatic Instruction Prefetching: A Cooperative Approach. Carnegie Mellon University Technical Report CMU-CS-98-140, June 1998.
  25. Todd C. Mowry and Chi-Keung Luk. Predicting Data Cache Misses in Non-Numeric Applications Through Correlation Profiling. In Proceedings of the 30th Annual International Symposium on Microarchitecture (MICRO), pages 314-320, December 1997.
  26. Todd C. Mowry and Chi-Keung Luk. Predicting Data Cache Misses in Non-Numeric Applications Through Correlation Profiling. Carnegie Mellon University Technical Report CMU-CS-97-175, September 1997.
  27. Chi-Keung Luk and Todd C. Mowry. Compiler-Based Prefetching for Recursive Data Structures. In Proceedings of the Seventh International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), pages 222-233, October 1996.
  28. Kam-Wing Ng and Chi-Keung Luk. I+: A Multiparadigm Language for Object-Oriented Declarative Programming. In Computer Languages, vol. 21, no 2, pp. 81-100, 1995.
  29. Kam-Wing Ng and Chi-Keung Luk. A Survey of Languages Integrating Functional, Object-Oriented and Logic Programming. In Journal of Systems Architecture (formerly Microprocessing and Microprogramming), 41:5-36, 1995.
  30. Chi-Keung Luk. Memory Disambiguation for General-Purpose Applications. In Proceedings of the IBM CASCON 95, November 1995.
  31. Chi-Keung Luk. The Design and Implementation of a Multiparadigm Programming Language. Master Thesis, Department of Computer Science, The Chinese University of Hong Kong, July 1993.
  32. Kam-Wing Ng and Chi-Keung Luk. An Integrated Computation Model for Parallel Programming. In Proceedings of IEEE TENCON 93/Beijing, October 1993.
  33. Kam-Wing Ng and Chi-Keung Luk. The Design of a Multiparadigm Programming Language I. In Proceedings of EuroMicro, 1993 (published as a special issue in the Journal of Systems Architecture 37:171-174, 1993).