Healing and Self-Repair in Large Scale Distributed Computing Systems

Summary

More about Professor Albert Y. Zomaya Stay in touch

The project will focus on the development of fault tolerance mechanisms to allow distributed systems to operate under different operating conditions.

Supervisor

Professor Albert Y. Zomaya.

Research location

Computer Science

Program type

Masters/PHD

Synopsis

As the complexity of distributed systems increases time there will be a need to endow such systems with capabilities that make them capable of operating in disaster scenarios. What makes this problem very complex is the heterogeneous nature of today’s distributed computing environments that could be made up of hundreds or thousands of components (computers, databases, etc). In addition, a user in one location might not be able to have control over other parts of the system. So it is rather logical that there is a need for “smart” algorithms (protocols) that can achieve such an acceptable level of fault-tolerance and account for a variety of disaster recovery scenarios.

Want to find out more?

Interested in this opportunity? Want to know what to do next? Find out all you need to know about the application process including how to approach a potential supervisor via email and how to develop a research proposal.
Browse for other opportunities within the Computer Science.
Contact us to find out what's involved in applying for a PhD. Domestic students and international students.

Opportunity ID

The opportunity ID for this research opportunity is 978