Skip to content

Distributed Systems

Official Documentation

  • Apache Hadoop Documentation
  • Link
  • Apache Kafka Documentation
  • Link
  • Kubernetes Documentation
  • Link
  • Docker Documentation
  • Link

Research Papers

  • "The Google File System" by Sanjay Ghemawat, Howard Gobioff, and Shun-Tak Leung (2003)
  • Link
  • "MapReduce: Simplified Data Processing on Large Clusters" by Jeffrey Dean and Sanjay Ghemawat (2004)
  • Link
  • "Dynamo: Amazon's Highly Available Key-value Store" by Giuseppe DeCandia et al. (2007)
  • Link
  • "ZooKeeper: Wait-free coordination for Internet-scale systems" by Patrick Hunt et al. (2010)
  • Link

Additional Online Resources

  • Distributed Systems for Fun and Profit
  • Link
  • Designing Data-Intensive Applications by Martin Kleppmann
  • Link
  • MIT 6.824: Distributed Systems
  • Link
  • Awesome Distributed Systems
  • Link

Books

  • "Designing Distributed Systems" by Brendan Burns
  • ISBN: 1491983647
  • A practical guide to designing and implementing distributed systems using Kubernetes and other open source tools.
  • "Designing Data-Intensive Applications" by Martin Kleppmann
  • ISBN: 1449373321
  • Explores the principles, algorithms, and trade-offs of data systems, with a focus on distributed architectures.
  • "Distributed Systems: Principles and Paradigms" by Andrew S. Tanenbaum and Maarten van Steen
  • ISBN: 153028175X
  • A comprehensive introduction to the principles and paradigms underlying distributed systems.
  • "Building Microservices" by Sam Newman
  • ISBN: 1492034029
  • Provides practical advice on how to design and implement distributed systems using microservices architecture.

Standards

  • ISO/IEC/IEEE 60559:2020 Information technology — Microprocessor Systems — Floating-Point arithmetic
  • Relevant for ensuring consistency in floating-point operations across distributed systems
  • ISO/IEC 10746 Information technology — Open Distributed Processing
  • Provides a framework for the standardization of open distributed processing

Online Courses

  • Coursera: Cloud Computing Specialization by University of Illinois
  • Link
  • edX: Reliable Distributed Algorithms by KTH Royal Institute of Technology
  • Link
  • Udacity: Intro to Parallel Programming
  • Link