Early Internet & RPC: Difference between revisions
No edit summary |
m →Pros: - fix remove -> remote |
||
(4 intermediate revisions by one other user not shown) | |||
Line 1: | Line 1: | ||
==Readings== | ==Readings== | ||
kahn1972-resource.pdf | *'''[http://www.scs.carleton.ca/~soma/distos/2008-01-14/kahn1972-resource.pdf Robert E. Kahn, "Resource-Sharing Computer Communications Networks" (1972)]:''' | ||
This is one of the early papers describing the rationale behind the | |||
ARPANET (which later evolved into the Internet). For more | |||
background on the ARPANET, see [http://video.google.com/videoplay?docid=4989933629762859961 Computer Networks - The Heralds of Resource Sharing] (optional). | |||
Note how both the ARPANET and and standard operating systems were | |||
developed to facilitate resource sharing. | |||
*'''[http://www.scs.carleton.ca/~soma/distos/2008-01-14/nelson1981-rpc.pdf Bruce J. Nelson, ''Remote Procedure Call'' (1981)]:''' | |||
You only need to read the thesis summary which starts at page 224 in | |||
the PDF. If you have time, however, I'd suggest looking at the | |||
rest, particularly the introduction and related work. | |||
birrell1984-rpcimpl.pdf | *'''[http://www.scs.carleton.ca/~soma/distos/2008-01-14/birrell1984-rpcimpl.pdf Birrell & Nelson, "Implementing Remote Procedure Calls" (1984)]''' | ||
Compare the perspective of this RPC implementation description with | |||
the more design-oriented focus of Nelson's thesis. | |||
==Questions to be discussed== | ==Questions to be discussed== | ||
# What did the (technical) world look like when this paper was published? | |||
# What is the paper about? What are the key ideas? | |||
# What is the basic argument of the paper, and what sort of evidence is used to make the argument? | |||
# To what extent do you "believe" the argument? Why? | |||
# What did the authors get right about the future? What did they miss about the future? | |||
# What has been forgotten since this paper? | |||
==Presentations== | ==Presentations== | ||
== | ==Debate== | ||
Debate about the pros and cons of RPC. Topic: RPCs are the right foundation for the distributed applications (on today's Internet). | |||
===Pros=== | |||
# Easy to use | |||
#* makes remote procedure calls look like local calls | |||
#* local is easy | |||
#* therefore remote is easy with RPC - good! | |||
# (Counter to first con) Use standards, problem goes away | |||
# Easy to debug | |||
# Avoids complexity of protocol design | |||
# RPC can be abstracted from the protocol design | |||
# (Counter to fourth con) Use threads for interactivity. Manage complexity by doing things pairwise. | |||
# (Counter to fifth con) - wrong! | |||
# (Counter to seventh con) - Use standards (ORBs in CORBA, etc.) for authentication | |||
# (Counter to eighth con) - But you specify the API | |||
# (Counter to ninth con) - It's the same with other methods. But RPC makes it easier to do. | |||
===Cons=== | |||
# Language/implementation specific | |||
# (Counter to third pro) Not easy to debug. example: NullPointerException thrown on server, containing line of erroneous code running on the server. | |||
# Counter to fifth pro) But there is more overhead - slower | |||
# Synchronous - wrong model for large, distributed applications, and also bad for users | |||
# Limited scalability - hard to do well | |||
# Limited control of communication details | |||
# Authentication is opaque | |||
# Larger attack surface area | |||
# Poor match to listening (hard for server to communicate information back to client) | |||
# Backwards compatibility is difficult | |||
# RPC is not a "natural" interaction style - message passing is more similar to human communication | |||
# Too much to set up for simple applications | |||
# Less flexible | |||
===Anil's Comments=== | |||
RPC makes the simple things very easy, but it doesn't help with the big challenges of distributed programming: robustness and security. If those factors are of minimal importance (e.g. you are programming an isolated cluster running a very high-performance application), then this trade-off doesn't matter. However, there exist other distributed programming paradigms that are easy to use but that still distinguish between accessing internal functions and communicating with external entities. | |||
If we can keep the developer aware of when communication occurs, then there is a chance that developer will remember to do the input validation and error checking at those points, which is what is needed to make robust and secure applications. |
Latest revision as of 00:33, 16 April 2008
Readings
This is one of the early papers describing the rationale behind the ARPANET (which later evolved into the Internet). For more background on the ARPANET, see Computer Networks - The Heralds of Resource Sharing (optional).
Note how both the ARPANET and and standard operating systems were developed to facilitate resource sharing.
You only need to read the thesis summary which starts at page 224 in the PDF. If you have time, however, I'd suggest looking at the rest, particularly the introduction and related work.
Compare the perspective of this RPC implementation description with the more design-oriented focus of Nelson's thesis.
Questions to be discussed
- What did the (technical) world look like when this paper was published?
- What is the paper about? What are the key ideas?
- What is the basic argument of the paper, and what sort of evidence is used to make the argument?
- To what extent do you "believe" the argument? Why?
- What did the authors get right about the future? What did they miss about the future?
- What has been forgotten since this paper?
Presentations
Debate
Debate about the pros and cons of RPC. Topic: RPCs are the right foundation for the distributed applications (on today's Internet).
Pros
- Easy to use
- makes remote procedure calls look like local calls
- local is easy
- therefore remote is easy with RPC - good!
- (Counter to first con) Use standards, problem goes away
- Easy to debug
- Avoids complexity of protocol design
- RPC can be abstracted from the protocol design
- (Counter to fourth con) Use threads for interactivity. Manage complexity by doing things pairwise.
- (Counter to fifth con) - wrong!
- (Counter to seventh con) - Use standards (ORBs in CORBA, etc.) for authentication
- (Counter to eighth con) - But you specify the API
- (Counter to ninth con) - It's the same with other methods. But RPC makes it easier to do.
Cons
- Language/implementation specific
- (Counter to third pro) Not easy to debug. example: NullPointerException thrown on server, containing line of erroneous code running on the server.
- Counter to fifth pro) But there is more overhead - slower
- Synchronous - wrong model for large, distributed applications, and also bad for users
- Limited scalability - hard to do well
- Limited control of communication details
- Authentication is opaque
- Larger attack surface area
- Poor match to listening (hard for server to communicate information back to client)
- Backwards compatibility is difficult
- RPC is not a "natural" interaction style - message passing is more similar to human communication
- Too much to set up for simple applications
- Less flexible
Anil's Comments
RPC makes the simple things very easy, but it doesn't help with the big challenges of distributed programming: robustness and security. If those factors are of minimal importance (e.g. you are programming an isolated cluster running a very high-performance application), then this trade-off doesn't matter. However, there exist other distributed programming paradigms that are easy to use but that still distinguish between accessing internal functions and communicating with external entities.
If we can keep the developer aware of when communication occurs, then there is a chance that developer will remember to do the input validation and error checking at those points, which is what is needed to make robust and secure applications.