Computing Platforms

Content

The seminar will cover core concepts and ideas in the general area of computer systems, ranging from software and hardware architectures to system design for operating systems, data processing systems, and distributed systems. The focus will be on fundamental ideas that apply across systems and application areas but with an emphasis on those ideas that apply to cloud platforms and hardware accelerators.

Format

The seminar will consist on student presentations based on a list of papers that will be provided at the beginning of the course. Presentations will be arranged in slots of 20 minutes talk plus 10 minutes questions. Grades will be assigned based on quality of the presentation, coverage of the topic including material not in the original papers, participation during the seminar, and ability to understand, present, and criticize the underlying technology.

Seminar Hours

Mondays, 4-6pm, at CHN D 44. The first seminar will be on February 19th.

Lecturers

  • Prof. Gustavo Alonso
  • Dr. Michael Giardino

Teaching Assistant

  • Michael Wawrzoniak ()

Schedule

Papers

You may need to click on the links from within the ETH network (via VPN) to get the full-text papers.

1. Michael Armbrust, Armando Fox, Rean Griffith, Anthony D. Joseph, Randy Katz, Andy Konwinski, Gunho Lee, David Patterson, Ariel Rabkin, Ion Stoica, Matei Zaharia (2010) A View of Cloud Computing. In: CACM. [external pageLink]

2. Barroso, L., Marty, M., Patterson, D., & Ranganathan, P. (2017). Attack of the killer microseconds. In: CACM. [external pageLink]

3. Primorac, M., Bugnion, E., & Argyraki, K. (2017). How to measure the killer microsecond. In: CCR. [external pageLink]

4. Delimitrou, C., & Kozyrakis, C. (2018). Amdahl’s law for tail latency: Queueing theoretic models can guide design trade-​offs in systems targeting tail latency, not just average performance. In: CACM. [external pageLink]

5. Huaicheng Li , Daniel S. Berger, Stanko Novakovic , Lisa Hsu , Dan Ernst , Pantea Zardoshti, Monish Shah , Samir Rajadnya , Scott Lee , Ishwar Agarwal , Mark D. Hill , Marcus Fontoura , Ricardo Bianchini. (2023) Pond: CXL-Based Memory Pooling Systems for Cloud Platforms. In: ASPLOS. [external pageLink

6. Hasan Al Maruf, Hao Wang, Abhishek Dhanotia, Johannes Weiner, Niket Agarwal, Pallab Bhattacharya, Chris Petersen, Mosharaf Chowdhury, Shobhit Kanaujia, Prakash Chauhan. (2023) TPP: Transparent Page Placement for CXL-Enabled Tiered-Memory. In: ASPLOS. [external pageLink]

7. Z Guo, Y Shan, X Luo, Y Huang, Y Zhang. Clio: A Hardware-Software Co-Designed Disaggregated Memory System. ASPLOS, 2022 [external pageLink]

8. Q Zhang, P Bernstein, D Berger, B Chandramouli. Redy: Remote Dynamic Memory Cache. VLDB, 2022. [external pageLink]

9. Shafer, J., Rixner, S., & Cox, A. L. (2010). The Hadoop distributed file system: Balancing portability and performance. In: ISPASS. [external pageLink1][external pageLink2]

10. Burrows, M. (2006). The Chubby lock service for loosely-​coupled distributed systems. In: OSDI. [external pageLink]

11. Hunt, P., Konar, M., Junqueira, F. P., & Reed, B. (2010). ZooKeeper: Wait-​free coordination for internet-​scale systems. In: USENIX ATC. [external pageLink]

12. DeCandia, G., Hastorun, D., Jampani, M., Kakulapati, et al. (2007). Dynamo: Amazon’s Highly Available Key-​value Store. In: SIGOPS. [external pageLink]

13. Corbett, J. C., Dean, J., Epstein, M., et al. (2012). Spanner: Google’s Globally-​Distributed Database. In: OSDI. [external pageLink]

14. Bacon, D. F., Bales, N., Bruno, N., et al. (2017). Spanner: Becoming a SQL system. In: SIGMOD. [external pageLink]

15. Armbrust, M., Ghodsi, A., Zaharia, M., et al. (2015). Spark SQL: Relational Data Processing in Spark. In: SIGMOD. [external pageLink]

16. Chen, G. J., Wiener, J. L., Iyer, S., Jaiswa, et al. (2016). Realtime Data Processing at Facebook. In: SIGMOD. [external pageLink]

17. Verbitski et al (2017) Amazon Aurora: Design Considerations for High Throughput Cloud-Native Relational Databases. In: SIGMOD '17. [external pageLink]

18. Wang et al. (2023) PolarDB-IMCI: A Cloud-Native HTAP Database System at Alibaba. In: SIGMOD'23. [external pageLink]

19. Lasse Thostrup, Gloria Doci, Nils Boeschen, Manisha Luthra, Carsten Binnig. Distributed GPU Joins on Fast RDMA-capable Networks. SIGMOD 2023 [external pageLink]

20. Fang, Yuanwei and Zou, Chen and Chien, Andrew A. Accelerating raw data analysis with the ACCORDA software and hardware architecture. VLDB 2019 [external pageLink]

21. Harshit Saokar, Soteris Demetriou, Nick Magerko, Max Kontorovich, Josh Kirstein, and Margot Leibold, Dimitrios Skarlatos, Hitesh Khandelwal and Chunqiang Tang. (2023) ServiceRouter: Hyperscale and Minimal Cost Service Mesh at Meta. In OSDI'23. [external pageLink]

22. Johann Schleier-Smith, Vikram Sreekanti, Anurag Khandelwal, Joao Carreira, Neeraja J. Yadwadkar, Raluca Ada Popa, Joseph E. Gonzalez, Ion Stoica, David A. Patterson. What Serverless Computing Is and Should Become: The Next Phase of Cloud Computing. In: CACM. [external pageLink]

23. Saxena et al. (2023). The Story of AWS Glue. In: VLDB'23. [external pageLink]

24. Shankar, V., Krauth, K., Vodrahalli, K., Pu, Q., et al. (2020). Serverless linear algebra. In: SoCC. [external pageLink]

25. Klimovic, A., Wang, Y., Stuedi, P., et al. (2018). Pocket: Elastic Ephemeral Storage for Serverless Analytics. In: OSDI. [external pageLink]

26. S Fouladi, F Romero et al. From Laptop to Lambda: Outsourcing Everyday Jobs to Thousands of Transient Functional Containers. ATC, 2019. [external pageLink]

27. Ao Wang, Jingyuan Zhang, Xiaolong Ma, Ali Anwar, Lukas Rupprecht, Dimitrios Skourtis, Vasily Tarasov, Feng Yan, Yue Cheng (2020). INFINICACHE: exploiting ephemeral serverless functions to build a cost-effective memory cache. In FAST'20. [external pageLink]

 

Invited Talks

TBD

Presentations Tips

 

JavaScript has been disabled in your browser