| 
   
  | 
  
   36th
  Annual International Symposium on Microarchitecture   | 
  
   
  | 
 
Sponsored by IEEE TC-MARCH
 and ACM
SIGMICRO 
Micro 36 Program on CD
Important Notes:
  Sessions
  
   | 
 
 
| 
   Session
  8: Dataflow, Data Parallel, and Clustered Architectures  | 
 
  
  Keynote Speak 1
   | 
 
 
| 
   Microarchitecture
  on the MOSFET Diet Kerry Bernstein IBM T. J. Watson Research Center    | 
 
 
  Session 1: Voltage Scaling &
  Transient Faults
   | 
 
 
| 
   Razor: A Low-Power Pipeline Based on Circuit-Level Timing Speculation Dan Ernst, Nam Sung Kim, Sanjay Pant, Shidhartha Das, Rajeev Rao, Toan
  Pham, Conrad Ziesler, David Blaauw, Todd Austin, University of Michigan;
  Krisztian Flautner, ARM Ltd.; Trevor Mudge, University of Michigan.   VSV: L2-Miss-Driven Variable Supply-Voltage Scaling for Low Power Hai Li, Chen-Yong Cher, T. N. Vijaykumar, Kaushik Roy, Purdue.    Shubhendu S. Mukherjee, Christopher T. Weaver, Joel Emer, Intel
  Corporation; Steven K. Reinhardt, Intel Corporation and University
  of Michigan; Todd Austin, University of Michigan.    | 
 
 
  Session 2: Cache Design 
   | 
 
 
| 
  
   Bradford M. Beckmann, David A. Wood, University of
  Wisconsin-Madison.   Distance Associativity for High-Performance Energy-Efficient, Non-Uniform
  Cache Architectures Zeshan Chishti, Michael D. Powell, T. N. Vijaykumar, Purdue.     Near-Optimal Precharging in High-Performance Nanoscale CMOS Caches Se-Hyun Yang, Babak Falsafi, Carnegie Mellon University.     | 
 
 
  Session 3: Power and Energy
  Efficient Architectures
   | 
 
 
| 
   Single-ISA Heterogeneous Multi-Core Architectures: The Potential for
  Processor Power Reduction Rakesh Kumar, UCSD; Keith Farkas, Norman Jouppi, Partha
  Ranganathan, HP Labs; Dean Tullsen, UCSD.   Run-time Power Monitoring in High-End Processors: Methodology and
  Empirical Data Canturk Isci, Margaret Martonosi, Princeton University.     Power-driven Design of Router Microarchitectures in On-chip Networks Hangsheng Wang, Li-Shiuan Peh, Sharad Malik, Department of
  Electrical Engineering - Princeton University.   Optimum Power/Performance Pipeline Depth Allan Hartstein, Thomas R. Puzak, IBM T. J. Watson Research Center.    | 
 
 
  Session 4: Application-Specific
  Optimization and Analysis
   | 
 
 
| 
   Processor Acceleration through Automated Instruction Set
  Customization   Nathan Clark, Hongtao Zhong, Scott Mahlke, University of Michigan.   The Reconfigurable Streaming Vector Processor (RSVPä) Silviu Ciricescu, Ray Essick, Brian Lucas, Phil May, Kent Moat, Jim
  Norris, Michael Schuette, Motorola Labs; Ali Saidi, The Mitre
  Corporation.   Scaling and Characterizing Database Workloads: Bridging the Gap between
  Research and Practice Richard Hankins, Trung Diep, Murali Annavaram, Intel; Brian
  Hirano, Harald Eri, Oracle; Hubert Nueckel, John Shen, Intel.     | 
 
 
  Keynote Speak 2
   | 
 
 
| 
  
   Micheal Schlansker  Hewlett-Packard Laboratories     | 
 
 
  Session 5: Dynamic Optimization
  Systems
   | 
 
 
| 
   Generational Cache Management of Code Traces in Dynamic Optimization
  Systems Kim Hazelwood, Michael D. Smith, Harvard University.   The Performance of Runtime Data Cache Prefetching in a Dynamic
  Optimization System Jiwei Lu, Howard Chen, Rao Fu, Wei-Chung Hsu, Bobbie Othmer, Pen-Chung
  Yew, Univ. of Minnesota - Twin Cities; Dong-Yuan Chen, Intel.   IA-32 Execution Layer: a two phase dynamic translator designed to
  support IA-32 applications on Itaniumâ-based systems Leonid Baraz, Tevi Devor, Orna Etzion, Shalom Goldenberg, Alex
  Skaletsky, Yun Wang, Yigal Zemach, Intel.     | 
 
 
  Session 6: Dynamic Program Analysis
  and Optimization
   | 
 
 
| 
   LLVA: A Low-level Virtual Instruction Set Architecture Vikram Adve, Chris Lattner, Michael Brukman, Anand Shukla, Brian
  Gaeke, University of Illinois at Urbana-Champaign.   Comparing Program Phase Detection Techniques Ashutosh S. Dhodapkar, James E. Smith, University of
  Wisconsin-Madison.    Using Interaction Costs for Microarchitectural Bottleneck Analysis Brian A. Fields, Rastislav Bodik, University of
  California-Berkeley; Mark D. Hill, University of Wisconsin-Madison;
  Chris J. Newburn, Intel.     | 
 
 
  Session 7: Branch, Value, and
  Scheduling Optimizations
   | 
 
 
| 
   Fast Path-Based Neural Branch Prediction Daniel A. Jiménez, Rutgers University.   Hardware Support for Control Transfers in Code Caches Ho-Seop Kim, James E. Smith, University of Wisconsin-Madison.   Exploiting Value Locality in Physical Register Files Saisanthosh Balakrishnan, Gurindar S. Sohi, University of
  Wisconsin-Madison.   Macro-op Scheduling: Relaxing Scheduling Loop Constraints Ilhyun Kim, Mikko H. Lipasti, University of Wisconsin-Madison.    | 
 
 
  Session 8:  Dataflow, Data Parallel, and Clustered Architectures
   | 
 
 
| 
  
   Steven Swanson, Ken Michelson, Andrew Schwerin, Mark Oskin, University of Washington.   Universal Mechanisms for Data Parallel Architectures Karthikeyan
  Sankaralingam, Stephen W. Keckler, William R. Mark, Doug Burger, University
  of Texas - Austin.   Flexible Compiler-Managed L0 Buffers for Clustered VLIW Processors Enric Gibert, UPC - Barcelona; Jesús Sánchez, Antonio González, UPC - Barcelona and Intel Labs - Barcelona.   Instruction Replication for Clustered Architectures   Alex Aletŕ, Josep M.
  Codina, UPC - Barcelona; Antonio González, UPC - Barcelona and Intel
  Labs - Barcelona; David Kaeli, Northeastern University.     | 
 
 
  Session 9: Secure and Network
  Processors 
   | 
 
 
| 
   Efficient Memory Integrity Verification and Encryption for Secure
  Processor G. Edward Suh, Dwaine Clarke, Blaise Gassend, MIT; Marten van
  Dijk, Philips; Srinivas Devadas, MIT.   Fast Secure Processor for Inhibiting Software Piracy and Tampering Jun Yang, University of California-Riverside;Youtao Zhang, University
  of Texas-Dallas; Lan Gao, University of California-Riverside.   IPStash: A Power-Efficient Memory Architecture for IP Lookup Stefanos Kaxiras, Agere Systems; Georgios Keramidas, Department
  of ECE - Univ. of Patras.   Design and Implementation of High-Performance Memory Systems for Future
  Packet Buffers Jorge Garcia, Jesus Corbal, Llorenç Cerdŕ, Mateo Valero, UPC -
  Barcelona.     | 
 
 
  Session 10: Scaling Design 
   | 
 
 
| 
   Beating In-Order Stalls with "Flea-Flicker" Two-Pass
  Pipelining Ronald D. Barnes, Erik M. Nystrom, John W. Sias, Sanjay J. Patel,
  Nacho Navarro, Wen-mei W. Hwu, University of Illinois at Urbana-Champaign.   Scalable Hardware Memory Disambiguation for High ILP Processors Simha Sethumadhavan, Department of Computer Sciences - UT Austin; Rajagopalan
  Desikan, Department of Electrical and Computer Engineering - UT Austin; Doug
  Burger, Charles R. Moore, Stephen W. Keckler, Department of Computer
  Sciences - UT Austin.    Reducing Design Complexity of the Load/Store Queue Il Park, Chong Liang Ooi, T. N. Vijaykumar, Purdue.    Checkpoint Processing and Recovery: Towards Scalable Large Instruction
  Window Processors Haitham Akkary, Ravi Rajwar, Srikanth T. Srinivasan, Intel.     | 
 
| 
   General
  Chair  | 
  
   Bill
  Mangione-Smith  | 
 
| 
   Program
  Chair  | 
  
   Brad
  Calder  | 
 
| 
   Local
  Arrangements Chair  | 
  
   Dean
  Tullsen, UC San Diego   | 
 
| 
   Workshop
  and Tutorial Chair  | 
  
   Glenn
  Reinman, UCLA   | 
 
| 
   Publicity
  Chair  | 
  
   Jason
  Fritts, Washington U.   | 
 
| 
   Publication
  Chair  | 
  
   Gokhan
  Memik, Northwestern U.  | 
 
| 
   Registration
  Chair  | 
  
   Roger
  Bringmann, Agere Systems   | 
 
| 
   Steering
  Committee  | 
  
   Richard
  Belgard, Consultant, Chariman   | 
 
| 
   Program
  Committee  | 
  
   Saman
  Amarasinghe, MIT  |