CS-345 Distributed Systems
Winter 2005

Back

Course Schedule

Here is a list of the topics we will discuss, their scheduled dates and the related material we will cover. Each topic will be introduced with a lecture for which your are required to read the indicated (fraction of a) chapter from your textbook. After the introductory lecture, we will continue with presentations and discussions of related research papers.

The list includes a few papers everybody is responsible for (*) plus additional ones that are only the responsibility of the presenter (+). As a reference, I have also included a list of interesting related papers we will not have time to cover in class.

Schedule:


  1. Introduction: Defining distributed systems and their goals, hardware and software concepts, system models. Design issues and words of wisdom (Jan. 4 & 6, 2005).
    Slides: Welcome to Distributed Systems, Introduction.

    • DSP2 Chap. 1.
    • (*) D. Clifford Neuman. Scale in Distributed Systems. In Casavant, T. and Singhal M. (eds.), Readings in Distributed Computing Systems, Los Alamitos, CA, 1994, pp 463-489.
      Presenter (Date): Instructor (Jan. 6)
    • (+) B. Lampson. Hints for Computer System Design. In Proc. of the ACM Symposium on Operating Systems Principles, Dec. 1983.
    • (+) J. Saltzer, D. Reed and D. Clark. End-to-End Arguments in System Design. ACM Transactions on Computer Systems, 2(4):277-288, November 1984.


  2. Communication: Basics of communication, remote procedure calls and remote object invocation(Jan. 11 & 13, 2005)
    Slides: Communication.


  3. Wide-area distributed experimentation and PlanetLab: Issues evaluating wide-area distributed systems and brief introduction to PlanetLab (TBD) .

    • (*) L. Peterson, T. Anders, D. Culler and T. Roscoe. A Blueprint for Introducing Disruptive Technology into the Internet. In Proc. of the first ACM Workshop on Hot-Topics in Networks (HotNets-I), Oct. 2002.
      Presenter (Date): Stefan Birrer (TBD)
    • (+) A.Bavier, L.Peterson. M. Wawrzoniak, S. Karlin, T. Spalink, T. Roscoe, D. Culler, B. Chun and M. Bowman. Operating System Support for Planetary-Scale Network Services.
    • (+) B. Chun and T. Spalink. Slice Creation and Management. Tech. Report PDN-03-013, PlanetLab, July 2003.



    • Related papers we won't cover:

    • A.Nakao, L. Peterson and A. Bavier. A Routing Underlay for Overlay Networks. In Proc. of the ACM SIGCOMM Conf, August 2003.


  4. Processes: Clients, servers, code migration and software agents (Jan. 18 & 20, 2005)
    Slides: Processes.


  5. Naming: Naming and locating entities, removing unreferenced entities (Jan. 25 & 27, 2005).

    • DSP2 Chap. 4
    • (*) Jaeyeon Jung, Emil Sit, Hari Balakrishnan, and Robert Morris, DNS Performance and the Effectiveness of Caching.ACM SIGCOMM Internet Measurement Workshop, Nov. 2001.
      Presenter (Date): Zhichun Li (Jan. 25)
    • (+) P. Mockapetris and K. Dunlap.Development of the Domain Name System. In Proc. o the ACM SIGCOMM Conf., 1988.
    • (+) P. Danzig, K. Obracza and A. Kumar.An Analysis of Wide-Area Name Server Traffic. In Proc. o the ACM SIGCOMM Conf., 1992.
    • (*) William Adjie-Winoto, Elliot Schwartz, Hari Balakrishnan, Jeremy Lilley, The Design and Implementation of an Intentional Naming System, Proc. of the 17th ACM Symposium on Operating Systems Principles, Dec. 1999.
      Presenter (Date): Yao Zhao (Jan. 27)
    • (+) M. Balazinska, H. Balakrisnan and D. Karger. INS/Twine: A Scalable Peer-to-Peer Architecture for Intentional Resource Discovery. In Proc. of International Conference on Pervasive Computing, 2002.
    • (+) Y. Hu, D. Rodney and P. Druschel. Design and Scalability of NLS, a Scalable Naming and Location Service. In Proc. of IEEE INFOCOM, 2002.



    • Related papers we won't cover:

    • F. Bustamante, P. Widener and K. Schwan. Scalable Directory Services Using Proactivity. In Proc. of Supercomputing, 2002.
    • David R. Cheriton and Timothy P. Mann, Decentralizing a Global Naming Service for Improved Performance and Fault Tulerance, ACM Transactions on Computer Systems 7(2):147:183, 1989.
    • E. Pitoura and G. Samaras Locating Objects in Mobile Computing, in IEEE Transaction on Knowledge and Data Engineering 13(4), Jul/Aug 2001, pp 571-592.
    • M. van Steen, F. Hauc, G. Ballintijn and A. Tanenbaum. Algorithmic Design of the Globe Wide-Area Location Service. The Computer Journal, 41(5), 1998.


  6. Synchronization: Synchronization in distributed systems, logical time, global state, elections and mutual exclusion (Feb 1 & 3, 2005).


  7. Consistency and replication: Data replication, scalability, and consistency (Feb. 8 & 10, 2005).


  8. Fault tolerance: Making distributed systems fault tolerant, reliable and resilient multicasting (Feb. 15 & 17, 2005).

    • DSP2 Chap. 7
    • (*) J. Yin, J-P. Martin, A. Venkataramani, L. Alvisi, M. Dahlin, Separating Agreement from Execution for Byzantine Fault Tolerant Systems, in Proc. of 19th Symposium on Operating Systems Principles, Bolton Landing, Oct. 2003, pp 253-267
      Presenter (Date): Manan Sanghi (Feb. 15)
    • (+) M. Castro, R. Rodrigues and B. Liskov, BASE: Using abstraction to improve fault tolerance, ACM Trans. Comput. Syst. 21(3), 2003, pp 236-269.
    • (+) F. B. Schneider. Implementing Fault-tolerant Services using the State Machine Approach: a Tutorial. ACM Computing Surveys (CSUR), 22(4):299-319, December 1990.
    • (*) S. Birrer, D. Lu, F. Bustamante, Y. Qiao and P. Dinda. FatNemo: Building a Resilient Multi-Source Multicast Fat-Tree. In Proc. of the Ninth International Workshop on Web Content Caching and Distribution, 2004.
      Presenter (Date): Fabian Bustamante (Feb. 17)
    • (+) S. Banerjee, S. Lee, B. Bhattacharjee and A. Srinivasan. Resilient Multicast Using Overlays. In Proc. of ACM SIGMETRICS, 2003.
    • (+) M. Castro, P. Druschel, A-M. Kermarrec, A. Nandi, A. Rowstron and A. Singh. SplitStream: High-Bandwidth Multicast in Cooperative Environments. In Proc. of Symposium on Operating Systems Principles, 2003.


    • Related papers we won't cover:

    • S. Birrer and F. Bustamante. Nemo- Resilient Peer-to-Peer Multicast without the Cost. In Proc. of the 12th Annual Multimedia Computing and Networking Conference, 2005.
    • V. Padmanabhan, H. Wang and P. Chou. Resilient peer-to-peer streaming. In. Proc. of IEEE ICNP, 2003.
    • R. Haskin, Y. Malachi, W. Sawdon, and G. Chan, Recovery Management in Quicksilver, ACM Transactions on Computer Systems, 6(1), Feb. 1998, pp 82-108


  9. Security:Secure communication and authorization in distributed systems (Feb. 22 & 24, 2005).


  10. Review of major distributed systems paradigms: Distributed object systems, Distributed file systems, Distributed document-based systems, and Distributed coordination-based systems (Mar. 1 & 3, 2005).


  11. Project presentations (Mar. 8 & 10, 2005).

  12. Take-home exam (Mar. 14 to 18, 2005)

Last modified: Mon Jan 3 16:43:19 CST 2005