<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">
	<id>https://homeostasis.scs.carleton.ca/wiki/api.php?action=feedcontributions&amp;feedformat=atom&amp;user=Alp</id>
	<title>Soma-notes - User contributions [en]</title>
	<link rel="self" type="application/atom+xml" href="https://homeostasis.scs.carleton.ca/wiki/api.php?action=feedcontributions&amp;feedformat=atom&amp;user=Alp"/>
	<link rel="alternate" type="text/html" href="https://homeostasis.scs.carleton.ca/wiki/index.php/Special:Contributions/Alp"/>
	<updated>2026-04-24T08:23:47Z</updated>
	<subtitle>User contributions</subtitle>
	<generator>MediaWiki 1.42.1</generator>
	<entry>
		<id>https://homeostasis.scs.carleton.ca/wiki/index.php?title=DistOS_2014W_Lecture_1&amp;diff=18907</id>
		<title>DistOS 2014W Lecture 1</title>
		<link rel="alternate" type="text/html" href="https://homeostasis.scs.carleton.ca/wiki/index.php?title=DistOS_2014W_Lecture_1&amp;diff=18907"/>
		<updated>2014-03-25T14:40:36Z</updated>

		<summary type="html">&lt;p&gt;Alp: Typo.&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== What is an OS? ==&lt;br /&gt;
&lt;br /&gt;
An OS allows you to run on (slightly) different hardware. Here are some ideas of what and OS could mean and some functionalities and responsibilities that OSes include:&lt;br /&gt;
* A hardware abstraction such that hardware resources can be accessed by software&lt;br /&gt;
* Provides consistent execution environment, which hardware doesn&#039;t provide (ie. code written to interface &amp;amp;emdash; think portable code)&lt;br /&gt;
* Manages I/O (such as user I/O, machine I/O i.e. network I/O, sensors, videos, etc.)&lt;br /&gt;
* Resource management through multiplexing and policy use&lt;br /&gt;
** Multiplexing (sharing): one resource wanted by multiple users&lt;br /&gt;
* Communication infrastructure (example inter process communication mechanisms) between the users (process, applications) of the operating system.&lt;br /&gt;
* OS turns a computer you want to a computer you want to program&lt;br /&gt;
* Manages synchronization and concurrency issues&lt;br /&gt;
&lt;br /&gt;
An OS can be defined by the role it plays in the programming of systems. It takes care of resource management and creates abstraction. An OS turns hardware into the computer/API/interface &#039;&#039;&#039;want&#039;&#039;&#039; to program.&lt;br /&gt;
&lt;br /&gt;
This is similar to how the browser is becoming the OS of the web. The browser is&lt;br /&gt;
the key abstraction needed to run web apps. It is the interface web developers target.&lt;br /&gt;
It doesn&#039;t matter what you consume a given website on (eg. a phone, tablet,&lt;br /&gt;
etc.), the browser abstracts the device&#039;s hardware and OS away.&lt;br /&gt;
&lt;br /&gt;
== What is a distributed OS? ==&lt;br /&gt;
&lt;br /&gt;
Anil prefers to think of this &#039;logically&#039; than functionally/physically.  This is&lt;br /&gt;
because the old distributed operating system (DOS) model applies to today&#039;s systems&lt;br /&gt;
(ie. managing multiple cores, etc). The traditional definition is systems that&lt;br /&gt;
manage their resources over a Network.&lt;br /&gt;
&lt;br /&gt;
A lot of these definitions are hard to peg down because simplicity always gets in&lt;br /&gt;
the way of truth. These concepts to do not fit into well defined classes.&lt;br /&gt;
&lt;br /&gt;
To draw parallels to our previous definition of operating systems, a distributed OS takes the distributed pieces of a system, and turn it into the system you want.&lt;br /&gt;
&lt;br /&gt;
It is good to think about about DOSes within the context of who/what is in&lt;br /&gt;
control, in terms of who makes and enforces decisions in DOSes. In essence, who is in charge? The traditional kernel-process model is a dictatorship, an authoritarian&lt;br /&gt;
model of control. The kernel controls what lives or dies.  The internet, incontrast, is decentralised (eg. DNS&amp;amp;emdash;to some extent, ignoring centralized roots). Distributed systems may have distributed&lt;br /&gt;
policies where there is not one source of power. Even in DOS we see instances of authoritarian/centralized approaches one example being the walled garden model employed by Apple&#039;s iOS. Anil&#039;s observation is that centralized systems has an inherent fragility built into and these kind of systems come into existence and disappear after a while. Examples being AOL, Myspace. Even the Facebook also looks to be a possible candidate for a similar fate. Also, concentrations on policy will tend to fall apart in the future.&lt;/div&gt;</summary>
		<author><name>Alp</name></author>
	</entry>
	<entry>
		<id>https://homeostasis.scs.carleton.ca/wiki/index.php?title=DistOS_2014W_Lecture_1&amp;diff=18906</id>
		<title>DistOS 2014W Lecture 1</title>
		<link rel="alternate" type="text/html" href="https://homeostasis.scs.carleton.ca/wiki/index.php?title=DistOS_2014W_Lecture_1&amp;diff=18906"/>
		<updated>2014-03-25T14:39:47Z</updated>

		<summary type="html">&lt;p&gt;Alp: Trivial English semantic fixes.&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== What is an OS? ==&lt;br /&gt;
&lt;br /&gt;
An OS allows you to run on (slightly) different hardware. Here are some ideas of what and OS could mean and some functionalities and responsibilities that OSes include:&lt;br /&gt;
* A hardware abstraction such that hardware resources can be accessed by software&lt;br /&gt;
* Provides consistent execution environment, which hardware doesn&#039;t provide (ie. code written to interface &amp;amp;emdash; think portable code)&lt;br /&gt;
* Manages I/O (such as user I/O, machine I/O i.e. network I/O, sensors, videos, etc.)&lt;br /&gt;
* Resource management through multiplexing and policy use&lt;br /&gt;
** Multiplexing (sharing): one resource wanted by multiple users&lt;br /&gt;
* Communication infrastructure (example inter process communication mechanisms) between the users (process, applications) of the operating system.&lt;br /&gt;
* OS turns a computer you want to a computer you want to program&lt;br /&gt;
* Manages synchronization and concurrency issues&lt;br /&gt;
&lt;br /&gt;
An OS can be defined by the role it plays in the programming of systems. It takes care of resource management and creates abstraction. An OS turns hardware into the computer/API/interface &amp;quot;&amp;quot;&amp;quot;want&amp;quot;&amp;quot;&amp;quot; to program.&lt;br /&gt;
&lt;br /&gt;
This is similar to how the browser is becoming the OS of the web. The browser is&lt;br /&gt;
the key abstraction needed to run web apps. It is the interface web developers target.&lt;br /&gt;
It doesn&#039;t matter what you consume a given website on (eg. a phone, tablet,&lt;br /&gt;
etc.), the browser abstracts the device&#039;s hardware and OS away.&lt;br /&gt;
&lt;br /&gt;
== What is a distributed OS? ==&lt;br /&gt;
&lt;br /&gt;
Anil prefers to think of this &#039;logically&#039; than functionally/physically.  This is&lt;br /&gt;
because the old distributed operating system (DOS) model applies to today&#039;s systems&lt;br /&gt;
(ie. managing multiple cores, etc). The traditional definition is systems that&lt;br /&gt;
manage their resources over a Network.&lt;br /&gt;
&lt;br /&gt;
A lot of these definitions are hard to peg down because simplicity always gets in&lt;br /&gt;
the way of truth. These concepts to do not fit into well defined classes.&lt;br /&gt;
&lt;br /&gt;
To draw parallels to our previous definition of operating systems, a distributed OS takes the distributed pieces of a system, and turn it into the system you want.&lt;br /&gt;
&lt;br /&gt;
It is good to think about about DOSes within the context of who/what is in&lt;br /&gt;
control, in terms of who makes and enforces decisions in DOSes. In essence, who is in charge? The traditional kernel-process model is a dictatorship, an authoritarian&lt;br /&gt;
model of control. The kernel controls what lives or dies.  The internet, incontrast, is decentralised (eg. DNS&amp;amp;emdash;to some extent, ignoring centralized roots). Distributed systems may have distributed&lt;br /&gt;
policies where there is not one source of power. Even in DOS we see instances of authoritarian/centralized approaches one example being the walled garden model employed by Apple&#039;s iOS. Anil&#039;s observation is that centralized systems has an inherent fragility built into and these kind of systems come into existence and disappear after a while. Examples being AOL, Myspace. Even the Facebook also looks to be a possible candidate for a similar fate. Also, concentrations on policy will tend to fall apart in the future.&lt;/div&gt;</summary>
		<author><name>Alp</name></author>
	</entry>
	<entry>
		<id>https://homeostasis.scs.carleton.ca/wiki/index.php?title=DistOS_2014W_Lecture_1&amp;diff=18905</id>
		<title>DistOS 2014W Lecture 1</title>
		<link rel="alternate" type="text/html" href="https://homeostasis.scs.carleton.ca/wiki/index.php?title=DistOS_2014W_Lecture_1&amp;diff=18905"/>
		<updated>2014-03-25T14:33:29Z</updated>

		<summary type="html">&lt;p&gt;Alp: Trivial English syntactic fixes.&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== What is an OS? ==&lt;br /&gt;
&lt;br /&gt;
An OS allows you to run on (slightly) different hardware. Here are some ideas of what and OS could mean and some functionalities and responsibilities that OSes include:&lt;br /&gt;
* A hardware abstraction such that hardware resources can be accessed by software&lt;br /&gt;
* Provides consistent execution environment, which hardware doesn&#039;t provide (ie. code written to interface &amp;amp;emdash; think portable code)&lt;br /&gt;
* Manages I/O (such as user I/O, machine I/O i.e. network I/O, sensors, videos, etc.)&lt;br /&gt;
* Resource management through multiplexing and policy use&lt;br /&gt;
** Multiplexing (sharing): one resource wanted by multiple users&lt;br /&gt;
* Communication infrastructure (example inter process communication mechanisms) between the users (process, applications) of the operating system.&lt;br /&gt;
* OS turns a computer you want to a computer you want to program&lt;br /&gt;
* Manages synchronization and concurrency issues&lt;br /&gt;
&lt;br /&gt;
An OS can be defined by the role it plays in the programming of systems. It takes care of resource management and creates abstraction. An OS turns hardware into the computer/API/interface &amp;quot;&amp;quot;&amp;quot;want&amp;quot;&amp;quot;&amp;quot; to program.&lt;br /&gt;
&lt;br /&gt;
This is similar to how the browser is becoming the OS of the web. The browser is&lt;br /&gt;
the key abstraction needed to run web apps. It is the interface web developers target.&lt;br /&gt;
It doesn&#039;t matter what you consume a given website on (eg. a phone, tablet,&lt;br /&gt;
etc.), the browser abstracts the device&#039;s hardware and OS away.&lt;br /&gt;
&lt;br /&gt;
== What is a distributed OS? ==&lt;br /&gt;
&lt;br /&gt;
Anil prefers to think of this &#039;logically&#039; than functionally/physically.  This is&lt;br /&gt;
because the old distributed operating system (DOS) model applies to today&#039;s systems&lt;br /&gt;
(ie. managing multiple cores, etc). The traditional definition is systems that&lt;br /&gt;
manage their resources over a Network.&lt;br /&gt;
&lt;br /&gt;
A lot of these definitions are hard to peg down because simplicity always gets in&lt;br /&gt;
the way of truth. These concepts to do not fit into well defined classes.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Anil&#039;s definition&#039;&#039;&#039; (following up on the similar note on what a traditional OS is, as seen above): &amp;quot;taking the distributed pieces of a system you have and&lt;br /&gt;
turning it into the system you WANT.&amp;quot;&lt;br /&gt;
&lt;br /&gt;
It is good to think about about DOSes within the context of who/what is in&lt;br /&gt;
control, in terms of who makes and enforces decisions in DOSes. In essence, who is in charge? The traditional kernel-process model is a dictatorship. Authoritarian&lt;br /&gt;
model of control. The kernel controls what lives or dies.  The internet, by&lt;br /&gt;
contrast, is decentralised (eg. DNS). Distributed systems may have distributed&lt;br /&gt;
policies where there is not one source of power. Even in DOS paradigm we can see instances of authoritarian/centralized approaches one example being the walled garden model employed by Apple&#039;s iOS. Anil&#039;s observation is that centralized systems has an inherent fragility built into and these kind of systems come into existence and disappear after a while. Examples being AOL, Myspace. Even the Facebook also looks to be a possible candidate for a similar fate. Also, concentrations on policy will tend to fall apart in the future.&lt;/div&gt;</summary>
		<author><name>Alp</name></author>
	</entry>
	<entry>
		<id>https://homeostasis.scs.carleton.ca/wiki/index.php?title=DistOS_2014W_Lecture_1&amp;diff=18904</id>
		<title>DistOS 2014W Lecture 1</title>
		<link rel="alternate" type="text/html" href="https://homeostasis.scs.carleton.ca/wiki/index.php?title=DistOS_2014W_Lecture_1&amp;diff=18904"/>
		<updated>2014-03-25T14:14:14Z</updated>

		<summary type="html">&lt;p&gt;Alp: Semantic markup fixup.&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== What is an OS? ==&lt;br /&gt;
&lt;br /&gt;
An OS allows you to run on (slightly) different hardware. Here are some ideas of what and OS could mean and some functionalities and responsibilities that OSes include::&lt;br /&gt;
* A hardware abstraction such that hardware resources can be accessed by software&lt;br /&gt;
* Provides consistent execution environment, which hardware doesn&#039;t provide (ie. code written to interface -- think portable code)&lt;br /&gt;
* Manages I/O (such as user I/O, machine I/O i.e. network I/O, sensors, videos, etc.)&lt;br /&gt;
* Resource management through multiplexing and policy use&lt;br /&gt;
** Multiplexing (sharing): one resource wanted by multiple users&lt;br /&gt;
* Communication infrastructure (example Inter Process Communication mechanisms) between the users (process, applications) of the Operating System.&lt;br /&gt;
* OS turns a computer you want to a computer you want to program&lt;br /&gt;
* Manages synchronization and concurrency issues&lt;br /&gt;
&lt;br /&gt;
An OS can be defined by the role it plays in the programming of systems. It takes care of resource management and creates abstraction. An OS turns hardware into the computer/api/interface you WANT to program.&lt;br /&gt;
&lt;br /&gt;
This is similar to how the browser is becoming the OS of the web. The browser is&lt;br /&gt;
the key abstraction needed to run web apps. It is the interface web developers target.&lt;br /&gt;
It doesn&#039;t matter what you consume a given website on (eg. a phone, tablet,&lt;br /&gt;
etc.), the browser abstracts the device&#039;s hardware and OS away.&lt;br /&gt;
&lt;br /&gt;
== What is a distributed OS? ==&lt;br /&gt;
&lt;br /&gt;
Anil prefers to think of this &#039;logically&#039; than functionally/physically.  This is&lt;br /&gt;
because the old distributed operating system (DOS) model applies to today&#039;s systems&lt;br /&gt;
(ie. managing multiple cores, etc). The traditional definition is systems that&lt;br /&gt;
manage their resources over a Network.&lt;br /&gt;
&lt;br /&gt;
A lot of these definitions are hard to peg down because simplicity always gets in&lt;br /&gt;
the way of truth. These concepts to do not fit into well defined classes.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Anil&#039;s definition&#039;&#039;&#039; (following up on the similar note on what a traditional OS is, as seen above): &amp;quot;taking the distributed pieces of a system you have and&lt;br /&gt;
turning it into the system you WANT.&amp;quot;&lt;br /&gt;
&lt;br /&gt;
It is good to think about about DOS&#039;s within the context of who/what is in&lt;br /&gt;
control, in terms of who makes and enforces decisions in DOS. In essence, who is in charge? The traditional kernel-process model is a dictatorship. Authoritarian&lt;br /&gt;
model of control. The kernel controls what lives or dies.  The internet, by&lt;br /&gt;
contrast, is decentralised (eg. DNS). Distributed systems may have distributed&lt;br /&gt;
policies where there is not one source of power. Even in DOS paradigm we can see instances of authoritarian/centralized approaches one example being the walled garden model employed by Apple iOS. Anil&#039;s observation is that centralized systems has an inherent fragility built into and these kind of systems come into existence and disappear after a while. Examples being AOL, Myspace. Even the Facebook also looks to be a possible candidate for a similar fate. Also, concentrations on policy will tend to fall apart in the future.&lt;/div&gt;</summary>
		<author><name>Alp</name></author>
	</entry>
	<entry>
		<id>https://homeostasis.scs.carleton.ca/wiki/index.php?title=DistOS_2014W_Lecture_10&amp;diff=18581</id>
		<title>DistOS 2014W Lecture 10</title>
		<link rel="alternate" type="text/html" href="https://homeostasis.scs.carleton.ca/wiki/index.php?title=DistOS_2014W_Lecture_10&amp;diff=18581"/>
		<updated>2014-02-06T16:30:04Z</updated>

		<summary type="html">&lt;p&gt;Alp: Raw note dump&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;==Context==&lt;br /&gt;
&lt;br /&gt;
== GFS ==&lt;br /&gt;
&lt;br /&gt;
* Very different because of the workload that it is desgined for.&lt;br /&gt;
** Because of the number of small files that have to be indexed for the web, etc., it is no longer practical to have a filesystem that stores these individually. Too much overhead. Punts problem to userspace, incl. record delimitation.&lt;br /&gt;
* Don&#039;t care about latency, surprising considering it&#039;s Google, the guys who change the TCP IW standard recommendations for latency.&lt;br /&gt;
* Mostly seeking through entire file.&lt;br /&gt;
* Paper from 2003, mentions still using 100BASE-T links.&lt;br /&gt;
* Data-heavy, metadata light. Contacting the metadata server is a rare event.&lt;br /&gt;
* Really good that they designed for unreliable hardware:&lt;br /&gt;
** All the replication&lt;br /&gt;
** Data checksumming&lt;br /&gt;
* Performance degrades for small random access workload; use other filesystem.&lt;br /&gt;
* Path of least resistance to scale, not to do something super CS-smart.&lt;br /&gt;
* Google used to re-index every month, swapping out indexes. Now, it&#039;s much more online. GFS is now just a layer to support a more dynamic layer.&lt;br /&gt;
&lt;br /&gt;
=== Segue on drives ===&lt;br /&gt;
&lt;br /&gt;
* Structure of GFS does match some other modern systems:&lt;br /&gt;
** Hard drives are like parallel tapes, very suited for streaming.&lt;br /&gt;
** Flash devices are log-structured too, but have an abstracting firmware. You want to do erasure in bulk, in the &#039;&#039;&#039;background&#039;&#039;&#039;. Used to be we needed specialized FS for MTDs to get better performance; though now we have better microcontrollers in some embedded systems to abstract away the hardware.&lt;br /&gt;
* Architectures that start big, often end up in the smallest things.&lt;br /&gt;
&lt;br /&gt;
== How other filesystems compare to GFS and Ceph ==&lt;br /&gt;
&lt;br /&gt;
* Data and metadata are held together.&lt;br /&gt;
** Doesn&#039;t account for different access patterns:&lt;br /&gt;
*** Data → big, long transfers&lt;br /&gt;
*** Metadata → small, low latency&lt;br /&gt;
** Can&#039;t scale separately&lt;br /&gt;
* By design, a file is a fraction of the size of a server&lt;br /&gt;
** Huge files spread over many servers not even in the cards for NFS&lt;br /&gt;
** Meant for small problems, not web-scale&lt;br /&gt;
*** Google has a copy of the publicly accessible internet&lt;br /&gt;
**** Their strategy is to copy the internet to index it&lt;br /&gt;
**** Insane → insane filesystem&lt;br /&gt;
* Designed for lower latency&lt;br /&gt;
* Designed for POSIX semantics; how the requirements that lead to the ‘standard’ evolved&lt;br /&gt;
* Even mainframes, scale-up solutions, ultra-reliable systems, with data sets bigger than RAM don&#039;t have this scale.&lt;br /&gt;
* Reliability was a property of the host, not the network&lt;br /&gt;
* Point-to-point access; much less load-balancing, even in AFS&lt;br /&gt;
** Single point of entry, single point of failure, bottleneck&lt;br /&gt;
&lt;br /&gt;
==Ceph==&lt;br /&gt;
&lt;br /&gt;
&amp;lt;ul&amp;gt;&lt;br /&gt;
&amp;lt;li&amp;gt;Ceph is crazy and tries to do everything&amp;lt;/li&amp;gt;&lt;br /&gt;
&amp;lt;li&amp;gt;Unlike GFS, distributes metadata, not just for read-only copies&amp;lt;/li&amp;gt;&lt;br /&gt;
&amp;lt;li&amp;gt;Unlike GFS, the OSDs have some intelligence, and autonomously distribute the data, rather than being controlled by a master.&lt;br /&gt;
&amp;lt;ul&amp;gt;&lt;br /&gt;
&amp;lt;li&amp;gt;Uses hashing in the distribution process to &#039;&#039;&#039;uniformly&#039;&#039;&#039; distribute data&amp;lt;/li&amp;gt;&lt;br /&gt;
&amp;lt;li&amp;gt;&amp;lt;p&amp;gt;The actual algorithm for distributing data is as follows:&amp;lt;/p&amp;gt;&lt;br /&gt;
&amp;lt;p&amp;gt;&amp;lt;math&amp;gt;file + offset → hash(object ID) → CRUSH(placement group) → OSD&amp;lt;/math&amp;gt;&amp;lt;/p&amp;gt;&amp;lt;/li&amp;gt;&lt;br /&gt;
&amp;lt;li&amp;gt;Each client has knowledge of the entire storage network.&amp;lt;/li&amp;gt;&lt;br /&gt;
&amp;lt;li&amp;gt;Tracks failure groups (same breaker, switch, etc.), hot data, etc.&amp;lt;/li&amp;gt;&lt;br /&gt;
&amp;lt;li&amp;gt;Number of replicas is changeable on the fly, but the placement group is not&lt;br /&gt;
&amp;lt;ul&amp;gt;&lt;br /&gt;
&amp;lt;li&amp;gt;For example, if every client on the planet is accessing the same file, you can scale out for that data.&amp;lt;/li&amp;gt;&amp;lt;/ul&amp;gt;&lt;br /&gt;
&amp;lt;/li&amp;gt;&lt;br /&gt;
&amp;lt;li&amp;gt;You don&#039;t ask where to go, you just go, which makes this very scalable&amp;lt;/li&amp;gt;&amp;lt;/ul&amp;gt;&lt;br /&gt;
&amp;lt;/li&amp;gt;&lt;br /&gt;
&amp;lt;li&amp;gt;CRUSH is sufficiently advanced to be called magic.&lt;br /&gt;
&amp;lt;ul&amp;gt;&lt;br /&gt;
&amp;lt;li&amp;gt;&amp;lt;math&amp;gt;O(log n)&amp;lt;/math&amp;gt; of the size of the data&amp;lt;/li&amp;gt;&lt;br /&gt;
&amp;lt;li&amp;gt;CPUs stupidly fast, so the above is of minimal overhead, whereas the network, despite being fast, has latency, etc. Computation scales much better than communication.&amp;lt;/li&amp;gt;&amp;lt;/ul&amp;gt;&lt;br /&gt;
&amp;lt;/li&amp;gt;&lt;br /&gt;
&amp;lt;li&amp;gt;Storage is composed of variable-length atoms&amp;lt;/li&amp;gt;&amp;lt;/ul&amp;gt;&lt;/div&gt;</summary>
		<author><name>Alp</name></author>
	</entry>
	<entry>
		<id>https://homeostasis.scs.carleton.ca/wiki/index.php?title=DistOS_2014W_Lecture_6&amp;diff=18488</id>
		<title>DistOS 2014W Lecture 6</title>
		<link rel="alternate" type="text/html" href="https://homeostasis.scs.carleton.ca/wiki/index.php?title=DistOS_2014W_Lecture_6&amp;diff=18488"/>
		<updated>2014-01-24T03:09:53Z</updated>

		<summary type="html">&lt;p&gt;Alp: Make header a second-level, as per MediaWiki style guidelines.&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;br /&gt;
&lt;br /&gt;
== Group Discussion on &amp;quot;The Early Web&amp;quot; ==&lt;br /&gt;
&lt;br /&gt;
Questions to discuss:&lt;br /&gt;
&lt;br /&gt;
# How do you think the web would have been if not like the present way? &lt;br /&gt;
# What kind of infrastructure changes would you like to make? &lt;br /&gt;
&lt;br /&gt;
=== Group 1 ===&lt;br /&gt;
: Relatively satisfied with the present structure of the web some changes suggested are in the below areas: &lt;br /&gt;
* Make use of the greater potential of Protocols &lt;br /&gt;
* More communication and interaction capabilities.&lt;br /&gt;
* Implementation changes in the present payment method systems. Example usage of &amp;quot;Micro-computation&amp;quot; - a discussion we would get back to in future classes. Also, Cryptographic currencies.&lt;br /&gt;
* Augmented reality.&lt;br /&gt;
* More towards individual privacy. &lt;br /&gt;
&lt;br /&gt;
=== Group 2 ===&lt;br /&gt;
==== Problem of unstructured information ====&lt;br /&gt;
A large portion of the web serves content that is overwhelmingly concerned about presentation rather than structuring content. Tim Berner-Lees himself bemoaned the death of the semantic web. His original vision of it was as follows:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;!-- Code from Wikipedia&#039;s article on the semantic web, except for the block quoting form, which this MediaWiki instance doens&#039;t seem to support. --&amp;gt;&lt;br /&gt;
&amp;lt;blockquote&amp;gt;I have a dream for the Web [in which computers] become capable of analyzing all the data on the Web&amp;amp;nbsp;– the content, links, and transactions between people and computers. A &amp;quot;Semantic Web&amp;quot;, which makes this possible, has yet to emerge, but when it does, the day-to-day mechanisms of trade, bureaucracy and our daily lives will be handled by machines talking to machines. The &amp;quot;intelligent agents&amp;quot; people have touted for ages will finally materialize.&amp;lt;ref&amp;gt;{{cite book |last=Berners-Lee |first=Tim |authorlink=Tim Berners-Lee |coauthors=Fischetti, Mark |title=Weaving the Web |publisher=HarperSanFrancisco |year=1999 |pages=chapter 12 |isbn=978-0-06-251587-2 |nopp=true }}&amp;lt;/ref&amp;gt;&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
For this vision to be true, information arguably needs to be structured, maybe even classified. The idea of a universal information classification system has been floated. The modern web is mostly developed by software developers and similar, not librarians and the like.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;!-- TODO: Yahoo blurb. --&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Also, how does one differentiate satire from fact?&lt;br /&gt;
&lt;br /&gt;
==== Valuation and deduplication of information ====&lt;br /&gt;
Another problem common with the current wwww is the duplication of information. Redundancy is not in itself harmful to increase the availability of information, but is ad-hoc duplication of the information itself?&lt;br /&gt;
&lt;br /&gt;
One then comes to the problem of assigning a value to the information found therein. How does one rate information, and according to what criteria? How does one authenticate the information? Often, popularity is used as an indicator of veracity, almost in a sophistic manner. See excessive reliance on Google page ranking or Reddit score for various types of information consumption for research or news consumption respectively.&lt;br /&gt;
&lt;br /&gt;
=== On the current infrastructure ===&lt;br /&gt;
The current &amp;lt;em&amp;gt;internet&amp;lt;/em&amp;gt; infrastructure should remain as is, at least in countries with not just a modicum of freedom of access to information. Centralization of of control of access to information is a terrible power. See China and parts of the Middle-East. On that note, what can be said of popular sites, such as Google or Wikipedia that serve as the main entry point for many access patterns?&lt;br /&gt;
&lt;br /&gt;
The problem, if any, in the current web infrastructure is of the web itself, not the internet.&lt;br /&gt;
&lt;br /&gt;
=== Group 3 ===&lt;br /&gt;
* What we want to keep &lt;br /&gt;
** Linking mechanisms&lt;br /&gt;
** Minimum permissions to publish&lt;br /&gt;
* What we don&#039;t like&lt;br /&gt;
** Relying on one source for document &lt;br /&gt;
** Privacy links for security&lt;br /&gt;
* Proposal &lt;br /&gt;
** Peer-peer to distributed mechanisms for documenting&lt;br /&gt;
** Reverse links with caching - distributed cache&lt;br /&gt;
** More availability for user - what happens when system fails? &lt;br /&gt;
** Key management to be considered - Is it good to have centralized or distributed mechanism? &lt;br /&gt;
&lt;br /&gt;
=== Group 4 ===&lt;br /&gt;
* An idea of web searching for us &lt;br /&gt;
* A suggestion of a different web if it would have been implemented by &amp;quot;AI&amp;quot; people&lt;br /&gt;
** AI programs searching for data - A notion already being implemented by Google slowly.&lt;br /&gt;
* Generate report forums&lt;br /&gt;
* HTML equivalent is inspired by the AI communication&lt;br /&gt;
* Higher semantics apart from just indexing the data&lt;br /&gt;
** Problem : &amp;quot;How to bridge the semantic gap?&amp;quot;&lt;br /&gt;
** Search for more data patterns&lt;br /&gt;
&lt;br /&gt;
== Group design exercise — The web that could be ==&lt;br /&gt;
&lt;br /&gt;
* “The web that wasn&#039;t” mentioned the moans of librarians.&lt;br /&gt;
* A universal classification system is needed.&lt;br /&gt;
* The training overhead of classifiers (e.g., librarians) is high. See the master&#039;s that a librarian would need.&lt;br /&gt;
* More structured content, both classification, and organization&lt;br /&gt;
* Current indexing by crude brute-force searching for words, etc., rather than searching metadata&lt;br /&gt;
* Information doesn&#039;t have the same persistence, see bitrot and Vint Cerf&#039;s talk.&lt;br /&gt;
* Too concerned with presentation now.&lt;br /&gt;
* Tim Berner-Lees bemoaning the death of the semantic web.&lt;br /&gt;
* The problem of information duplication when information gets redistributed across the web. However, we do want redundancy.&lt;br /&gt;
* Too much developed by software developers&lt;br /&gt;
* Too reliant on Google for web structure&lt;br /&gt;
** See search-engine optimization&lt;br /&gt;
* Problem of authentication (of the information, not the presenter)&lt;br /&gt;
** Too dependent at times on the popularity of a site, almost in a sophistic manner.&lt;br /&gt;
** See Reddit&lt;br /&gt;
* How do you programmatically distinguish satire from fact&lt;br /&gt;
* The web&#039;s structure is also “shaped by inbound links but would be nice a bit more”&lt;br /&gt;
* Infrastructure doesn&#039;t need to change per se.&lt;br /&gt;
** The distributed architecture should still stay. Centralization of control of allowed information and access is terrible power. See China and the Middle-East.&lt;br /&gt;
** Information, for the most part, in itself, exists centrally (as per-page), though communities (to use a generic term) are distributed.&lt;br /&gt;
* Need more sophisticated natural language processing.&lt;br /&gt;
&lt;br /&gt;
== Class discussion ==&lt;br /&gt;
&lt;br /&gt;
Focusing on vision, not the mechanism.&lt;br /&gt;
&lt;br /&gt;
* Reverse linking&lt;br /&gt;
* Distributed content distribution (glorified cache)&lt;br /&gt;
** Both for privacy and redunancy reasons&lt;br /&gt;
** Suggested centralized content certification, but doesn&#039;t address the problem of root of trust and distributed consistency checking.&lt;br /&gt;
*** Distributed key management is a holy grail&lt;br /&gt;
*** What about detecting large-scale subversion attempts, like in China&lt;br /&gt;
* What is the new revenue model?&lt;br /&gt;
** What was TBL&#039;s revenue model (tongue-in-cheek, none)?&lt;br /&gt;
** Organisations like Google monetized the internet, and this mechanism could destroy their ability to do so.&lt;br /&gt;
* Search work is semi-distributed. Suggested letting the web do the work for you.&lt;br /&gt;
* Trying to structure content in a manner simultaneously palatable to both humans and machines.&lt;br /&gt;
* Using spare CPU time on servers for natural language processing (or other AI) of cached or locally available resources.&lt;br /&gt;
* Imagine a smushed Wolfram Alpha, Google, Wikipedia, and Watson, and then distributed over the net.&lt;br /&gt;
* The document was TBL&#039;s idea of the atom of content, whereas nowaday we really need something more granular.&lt;br /&gt;
* We want to extract higher-level semantics.&lt;br /&gt;
* Google may not be pure keyword search anymore. It is essentially AI now, but we still struggle with expressing what we want to Google.&lt;br /&gt;
* What about the adversarial aspect of content hosters, vying for attention?&lt;br /&gt;
* People do actively try to fool you.&lt;br /&gt;
* Compare to Google News, though that is very specific to that domain. Their vision is a semantic web, but they are incrementally building it.&lt;br /&gt;
* In a scary fashion, Google is one of the central points of failure of the web. Even scarier is less technically competent people who depend on Facebook for that.&lt;br /&gt;
* There is a semantic gap between how we express and query information, and how AI understands it.&lt;br /&gt;
* Can think of Facebook as a distributed human search infrastructure.&lt;br /&gt;
* A core service of an operating system is locating information. &#039;&#039;&#039;Search is infrastructure.&#039;&#039;&#039;&lt;br /&gt;
* The problem is not purely technical. There are political and social aspects.&lt;br /&gt;
** Searching for a file on a local filesystem should have a unambiguous answer.&lt;br /&gt;
** Asking the web is a different thing. “What is the best chocolate bar?”&lt;br /&gt;
* Is the web a network database, as understood in COMP 3005, which we consider harmful.&lt;br /&gt;
* For two-way links, there is the problem of restructuring data and all the dependencies.&lt;br /&gt;
* Privacy issues when tracing paths across the web.&lt;br /&gt;
* What about the problem of information revocation?&lt;br /&gt;
* Need more augmented reality and distributed and micro payment systems.&lt;br /&gt;
* We need distributed, mutually untrusting social networks.&lt;br /&gt;
** Now we have the problem of storage and computation, but also take away some of of the monetizationable aspect.&lt;br /&gt;
* Distribution is not free. It is very expensive in very funny ways.&lt;br /&gt;
* The dream of harvesting all the computational power of the internet is not new.&lt;br /&gt;
** Startups have come and gone many times over that problem.&lt;br /&gt;
* Google&#039;s indexers understands quite well many documents on the web. However, it only &#039;&#039;&#039;presents&#039;&#039;&#039; a primitive keyword-like interface. It doesn&#039;t expose the ontology.&lt;br /&gt;
* Organising information does not necessarily mean applying an ontology to it.&lt;br /&gt;
* The organisational methods we now use don&#039;t use ontologies, but rather are supplemented by them.&lt;/div&gt;</summary>
		<author><name>Alp</name></author>
	</entry>
	<entry>
		<id>https://homeostasis.scs.carleton.ca/wiki/index.php?title=DistOS_2014W_Lecture_6&amp;diff=18487</id>
		<title>DistOS 2014W Lecture 6</title>
		<link rel="alternate" type="text/html" href="https://homeostasis.scs.carleton.ca/wiki/index.php?title=DistOS_2014W_Lecture_6&amp;diff=18487"/>
		<updated>2014-01-24T03:09:25Z</updated>

		<summary type="html">&lt;p&gt;Alp: Make header a second-level, as per MediaWiki style guidelines.&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;br /&gt;
&lt;br /&gt;
== Group Discussion on &amp;quot;The Early Web&amp;quot; ==&lt;br /&gt;
&lt;br /&gt;
Questions to discuss:&lt;br /&gt;
&lt;br /&gt;
# How do you think the web would have been if not like the present way? &lt;br /&gt;
# What kind of infrastructure changes would you like to make? &lt;br /&gt;
&lt;br /&gt;
=== Group 1 ===&lt;br /&gt;
: Relatively satisfied with the present structure of the web some changes suggested are in the below areas: &lt;br /&gt;
* Make use of the greater potential of Protocols &lt;br /&gt;
* More communication and interaction capabilities.&lt;br /&gt;
* Implementation changes in the present payment method systems. Example usage of &amp;quot;Micro-computation&amp;quot; - a discussion we would get back to in future classes. Also, Cryptographic currencies.&lt;br /&gt;
* Augmented reality.&lt;br /&gt;
* More towards individual privacy. &lt;br /&gt;
&lt;br /&gt;
=== Group 2 ===&lt;br /&gt;
==== Problem of unstructured information ====&lt;br /&gt;
A large portion of the web serves content that is overwhelmingly concerned about presentation rather than structuring content. Tim Berner-Lees himself bemoaned the death of the semantic web. His original vision of it was as follows:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;!-- Code from Wikipedia&#039;s article on the semantic web, except for the block quoting form, which this MediaWiki instance doens&#039;t seem to support. --&amp;gt;&lt;br /&gt;
&amp;lt;blockquote&amp;gt;I have a dream for the Web [in which computers] become capable of analyzing all the data on the Web&amp;amp;nbsp;– the content, links, and transactions between people and computers. A &amp;quot;Semantic Web&amp;quot;, which makes this possible, has yet to emerge, but when it does, the day-to-day mechanisms of trade, bureaucracy and our daily lives will be handled by machines talking to machines. The &amp;quot;intelligent agents&amp;quot; people have touted for ages will finally materialize.&amp;lt;ref&amp;gt;{{cite book |last=Berners-Lee |first=Tim |authorlink=Tim Berners-Lee |coauthors=Fischetti, Mark |title=Weaving the Web |publisher=HarperSanFrancisco |year=1999 |pages=chapter 12 |isbn=978-0-06-251587-2 |nopp=true }}&amp;lt;/ref&amp;gt;&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
For this vision to be true, information arguably needs to be structured, maybe even classified. The idea of a universal information classification system has been floated. The modern web is mostly developed by software developers and similar, not librarians and the like.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;!-- TODO: Yahoo blurb. --&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Also, how does one differentiate satire from fact?&lt;br /&gt;
&lt;br /&gt;
==== Valuation and deduplication of information ====&lt;br /&gt;
Another problem common with the current wwww is the duplication of information. Redundancy is not in itself harmful to increase the availability of information, but is ad-hoc duplication of the information itself?&lt;br /&gt;
&lt;br /&gt;
One then comes to the problem of assigning a value to the information found therein. How does one rate information, and according to what criteria? How does one authenticate the information? Often, popularity is used as an indicator of veracity, almost in a sophistic manner. See excessive reliance on Google page ranking or Reddit score for various types of information consumption for research or news consumption respectively.&lt;br /&gt;
&lt;br /&gt;
=== On the current infrastructure ===&lt;br /&gt;
The current &amp;lt;em&amp;gt;internet&amp;lt;/em&amp;gt; infrastructure should remain as is, at least in countries with not just a modicum of freedom of access to information. Centralization of of control of access to information is a terrible power. See China and parts of the Middle-East. On that note, what can be said of popular sites, such as Google or Wikipedia that serve as the main entry point for many access patterns?&lt;br /&gt;
&lt;br /&gt;
The problem, if any, in the current web infrastructure is of the web itself, not the internet.&lt;br /&gt;
&lt;br /&gt;
=== Group 3 ===&lt;br /&gt;
* What we want to keep &lt;br /&gt;
** Linking mechanisms&lt;br /&gt;
** Minimum permissions to publish&lt;br /&gt;
* What we don&#039;t like&lt;br /&gt;
** Relying on one source for document &lt;br /&gt;
** Privacy links for security&lt;br /&gt;
* Proposal &lt;br /&gt;
** Peer-peer to distributed mechanisms for documenting&lt;br /&gt;
** Reverse links with caching - distributed cache&lt;br /&gt;
** More availability for user - what happens when system fails? &lt;br /&gt;
** Key management to be considered - Is it good to have centralized or distributed mechanism? &lt;br /&gt;
&lt;br /&gt;
=== Group 4 ===&lt;br /&gt;
* An idea of web searching for us &lt;br /&gt;
* A suggestion of a different web if it would have been implemented by &amp;quot;AI&amp;quot; people&lt;br /&gt;
** AI programs searching for data - A notion already being implemented by Google slowly.&lt;br /&gt;
* Generate report forums&lt;br /&gt;
* HTML equivalent is inspired by the AI communication&lt;br /&gt;
* Higher semantics apart from just indexing the data&lt;br /&gt;
** Problem : &amp;quot;How to bridge the semantic gap?&amp;quot;&lt;br /&gt;
** Search for more data patterns&lt;br /&gt;
&lt;br /&gt;
= Group design exercise — The web that could be =&lt;br /&gt;
&lt;br /&gt;
* “The web that wasn&#039;t” mentioned the moans of librarians.&lt;br /&gt;
* A universal classification system is needed.&lt;br /&gt;
* The training overhead of classifiers (e.g., librarians) is high. See the master&#039;s that a librarian would need.&lt;br /&gt;
* More structured content, both classification, and organization&lt;br /&gt;
* Current indexing by crude brute-force searching for words, etc., rather than searching metadata&lt;br /&gt;
* Information doesn&#039;t have the same persistence, see bitrot and Vint Cerf&#039;s talk.&lt;br /&gt;
* Too concerned with presentation now.&lt;br /&gt;
* Tim Berner-Lees bemoaning the death of the semantic web.&lt;br /&gt;
* The problem of information duplication when information gets redistributed across the web. However, we do want redundancy.&lt;br /&gt;
* Too much developed by software developers&lt;br /&gt;
* Too reliant on Google for web structure&lt;br /&gt;
** See search-engine optimization&lt;br /&gt;
* Problem of authentication (of the information, not the presenter)&lt;br /&gt;
** Too dependent at times on the popularity of a site, almost in a sophistic manner.&lt;br /&gt;
** See Reddit&lt;br /&gt;
* How do you programmatically distinguish satire from fact&lt;br /&gt;
* The web&#039;s structure is also “shaped by inbound links but would be nice a bit more”&lt;br /&gt;
* Infrastructure doesn&#039;t need to change per se.&lt;br /&gt;
** The distributed architecture should still stay. Centralization of control of allowed information and access is terrible power. See China and the Middle-East.&lt;br /&gt;
** Information, for the most part, in itself, exists centrally (as per-page), though communities (to use a generic term) are distributed.&lt;br /&gt;
* Need more sophisticated natural language processing.&lt;br /&gt;
&lt;br /&gt;
== Class discussion ==&lt;br /&gt;
&lt;br /&gt;
Focusing on vision, not the mechanism.&lt;br /&gt;
&lt;br /&gt;
* Reverse linking&lt;br /&gt;
* Distributed content distribution (glorified cache)&lt;br /&gt;
** Both for privacy and redunancy reasons&lt;br /&gt;
** Suggested centralized content certification, but doesn&#039;t address the problem of root of trust and distributed consistency checking.&lt;br /&gt;
*** Distributed key management is a holy grail&lt;br /&gt;
*** What about detecting large-scale subversion attempts, like in China&lt;br /&gt;
* What is the new revenue model?&lt;br /&gt;
** What was TBL&#039;s revenue model (tongue-in-cheek, none)?&lt;br /&gt;
** Organisations like Google monetized the internet, and this mechanism could destroy their ability to do so.&lt;br /&gt;
* Search work is semi-distributed. Suggested letting the web do the work for you.&lt;br /&gt;
* Trying to structure content in a manner simultaneously palatable to both humans and machines.&lt;br /&gt;
* Using spare CPU time on servers for natural language processing (or other AI) of cached or locally available resources.&lt;br /&gt;
* Imagine a smushed Wolfram Alpha, Google, Wikipedia, and Watson, and then distributed over the net.&lt;br /&gt;
* The document was TBL&#039;s idea of the atom of content, whereas nowaday we really need something more granular.&lt;br /&gt;
* We want to extract higher-level semantics.&lt;br /&gt;
* Google may not be pure keyword search anymore. It is essentially AI now, but we still struggle with expressing what we want to Google.&lt;br /&gt;
* What about the adversarial aspect of content hosters, vying for attention?&lt;br /&gt;
* People do actively try to fool you.&lt;br /&gt;
* Compare to Google News, though that is very specific to that domain. Their vision is a semantic web, but they are incrementally building it.&lt;br /&gt;
* In a scary fashion, Google is one of the central points of failure of the web. Even scarier is less technically competent people who depend on Facebook for that.&lt;br /&gt;
* There is a semantic gap between how we express and query information, and how AI understands it.&lt;br /&gt;
* Can think of Facebook as a distributed human search infrastructure.&lt;br /&gt;
* A core service of an operating system is locating information. &#039;&#039;&#039;Search is infrastructure.&#039;&#039;&#039;&lt;br /&gt;
* The problem is not purely technical. There are political and social aspects.&lt;br /&gt;
** Searching for a file on a local filesystem should have a unambiguous answer.&lt;br /&gt;
** Asking the web is a different thing. “What is the best chocolate bar?”&lt;br /&gt;
* Is the web a network database, as understood in COMP 3005, which we consider harmful.&lt;br /&gt;
* For two-way links, there is the problem of restructuring data and all the dependencies.&lt;br /&gt;
* Privacy issues when tracing paths across the web.&lt;br /&gt;
* What about the problem of information revocation?&lt;br /&gt;
* Need more augmented reality and distributed and micro payment systems.&lt;br /&gt;
* We need distributed, mutually untrusting social networks.&lt;br /&gt;
** Now we have the problem of storage and computation, but also take away some of of the monetizationable aspect.&lt;br /&gt;
* Distribution is not free. It is very expensive in very funny ways.&lt;br /&gt;
* The dream of harvesting all the computational power of the internet is not new.&lt;br /&gt;
** Startups have come and gone many times over that problem.&lt;br /&gt;
* Google&#039;s indexers understands quite well many documents on the web. However, it only &#039;&#039;&#039;presents&#039;&#039;&#039; a primitive keyword-like interface. It doesn&#039;t expose the ontology.&lt;br /&gt;
* Organising information does not necessarily mean applying an ontology to it.&lt;br /&gt;
* The organisational methods we now use don&#039;t use ontologies, but rather are supplemented by them.&lt;/div&gt;</summary>
		<author><name>Alp</name></author>
	</entry>
	<entry>
		<id>https://homeostasis.scs.carleton.ca/wiki/index.php?title=DistOS_2014W_Lecture_6&amp;diff=18486</id>
		<title>DistOS 2014W Lecture 6</title>
		<link rel="alternate" type="text/html" href="https://homeostasis.scs.carleton.ca/wiki/index.php?title=DistOS_2014W_Lecture_6&amp;diff=18486"/>
		<updated>2014-01-24T03:08:19Z</updated>

		<summary type="html">&lt;p&gt;Alp: Add missing quotation mark in header&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;br /&gt;
&lt;br /&gt;
== Group Discussion on &amp;quot;The Early Web&amp;quot; ==&lt;br /&gt;
&lt;br /&gt;
Questions to discuss:&lt;br /&gt;
&lt;br /&gt;
# How do you think the web would have been if not like the present way? &lt;br /&gt;
# What kind of infrastructure changes would you like to make? &lt;br /&gt;
&lt;br /&gt;
=== Group 1 ===&lt;br /&gt;
: Relatively satisfied with the present structure of the web some changes suggested are in the below areas: &lt;br /&gt;
* Make use of the greater potential of Protocols &lt;br /&gt;
* More communication and interaction capabilities.&lt;br /&gt;
* Implementation changes in the present payment method systems. Example usage of &amp;quot;Micro-computation&amp;quot; - a discussion we would get back to in future classes. Also, Cryptographic currencies.&lt;br /&gt;
* Augmented reality.&lt;br /&gt;
* More towards individual privacy. &lt;br /&gt;
&lt;br /&gt;
=== Group 2 ===&lt;br /&gt;
==== Problem of unstructured information ====&lt;br /&gt;
A large portion of the web serves content that is overwhelmingly concerned about presentation rather than structuring content. Tim Berner-Lees himself bemoaned the death of the semantic web. His original vision of it was as follows:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;!-- Code from Wikipedia&#039;s article on the semantic web, except for the block quoting form, which this MediaWiki instance doens&#039;t seem to support. --&amp;gt;&lt;br /&gt;
&amp;lt;blockquote&amp;gt;I have a dream for the Web [in which computers] become capable of analyzing all the data on the Web&amp;amp;nbsp;– the content, links, and transactions between people and computers. A &amp;quot;Semantic Web&amp;quot;, which makes this possible, has yet to emerge, but when it does, the day-to-day mechanisms of trade, bureaucracy and our daily lives will be handled by machines talking to machines. The &amp;quot;intelligent agents&amp;quot; people have touted for ages will finally materialize.&amp;lt;ref&amp;gt;{{cite book |last=Berners-Lee |first=Tim |authorlink=Tim Berners-Lee |coauthors=Fischetti, Mark |title=Weaving the Web |publisher=HarperSanFrancisco |year=1999 |pages=chapter 12 |isbn=978-0-06-251587-2 |nopp=true }}&amp;lt;/ref&amp;gt;&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
For this vision to be true, information arguably needs to be structured, maybe even classified. The idea of a universal information classification system has been floated. The modern web is mostly developed by software developers and similar, not librarians and the like.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;!-- TODO: Yahoo blurb. --&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Also, how does one differentiate satire from fact?&lt;br /&gt;
&lt;br /&gt;
==== Valuation and deduplication of information ====&lt;br /&gt;
Another problem common with the current wwww is the duplication of information. Redundancy is not in itself harmful to increase the availability of information, but is ad-hoc duplication of the information itself?&lt;br /&gt;
&lt;br /&gt;
One then comes to the problem of assigning a value to the information found therein. How does one rate information, and according to what criteria? How does one authenticate the information? Often, popularity is used as an indicator of veracity, almost in a sophistic manner. See excessive reliance on Google page ranking or Reddit score for various types of information consumption for research or news consumption respectively.&lt;br /&gt;
&lt;br /&gt;
=== On the current infrastructure ===&lt;br /&gt;
The current &amp;lt;em&amp;gt;internet&amp;lt;/em&amp;gt; infrastructure should remain as is, at least in countries with not just a modicum of freedom of access to information. Centralization of of control of access to information is a terrible power. See China and parts of the Middle-East. On that note, what can be said of popular sites, such as Google or Wikipedia that serve as the main entry point for many access patterns?&lt;br /&gt;
&lt;br /&gt;
The problem, if any, in the current web infrastructure is of the web itself, not the internet.&lt;br /&gt;
&lt;br /&gt;
=== Group 3 ===&lt;br /&gt;
* What we want to keep &lt;br /&gt;
** Linking mechanisms&lt;br /&gt;
** Minimum permissions to publish&lt;br /&gt;
* What we don&#039;t like&lt;br /&gt;
** Relying on one source for document &lt;br /&gt;
** Privacy links for security&lt;br /&gt;
* Proposal &lt;br /&gt;
** Peer-peer to distributed mechanisms for documenting&lt;br /&gt;
** Reverse links with caching - distributed cache&lt;br /&gt;
** More availability for user - what happens when system fails? &lt;br /&gt;
** Key management to be considered - Is it good to have centralized or distributed mechanism? &lt;br /&gt;
&lt;br /&gt;
=== Group 4 ===&lt;br /&gt;
* An idea of web searching for us &lt;br /&gt;
* A suggestion of a different web if it would have been implemented by &amp;quot;AI&amp;quot; people&lt;br /&gt;
** AI programs searching for data - A notion already being implemented by Google slowly.&lt;br /&gt;
* Generate report forums&lt;br /&gt;
* HTML equivalent is inspired by the AI communication&lt;br /&gt;
* Higher semantics apart from just indexing the data&lt;br /&gt;
** Problem : &amp;quot;How to bridge the semantic gap?&amp;quot;&lt;br /&gt;
** Search for more data patterns&lt;br /&gt;
&lt;br /&gt;
= Group design exercise — The web that could be =&lt;br /&gt;
&lt;br /&gt;
* “The web that wasn&#039;t” mentioned the moans of librarians.&lt;br /&gt;
* A universal classification system is needed.&lt;br /&gt;
* The training overhead of classifiers (e.g., librarians) is high. See the master&#039;s that a librarian would need.&lt;br /&gt;
* More structured content, both classification, and organization&lt;br /&gt;
* Current indexing by crude brute-force searching for words, etc., rather than searching metadata&lt;br /&gt;
* Information doesn&#039;t have the same persistence, see bitrot and Vint Cerf&#039;s talk.&lt;br /&gt;
* Too concerned with presentation now.&lt;br /&gt;
* Tim Berner-Lees bemoaning the death of the semantic web.&lt;br /&gt;
* The problem of information duplication when information gets redistributed across the web. However, we do want redundancy.&lt;br /&gt;
* Too much developed by software developers&lt;br /&gt;
* Too reliant on Google for web structure&lt;br /&gt;
** See search-engine optimization&lt;br /&gt;
* Problem of authentication (of the information, not the presenter)&lt;br /&gt;
** Too dependent at times on the popularity of a site, almost in a sophistic manner.&lt;br /&gt;
** See Reddit&lt;br /&gt;
* How do you programmatically distinguish satire from fact&lt;br /&gt;
* The web&#039;s structure is also “shaped by inbound links but would be nice a bit more”&lt;br /&gt;
* Infrastructure doesn&#039;t need to change per se.&lt;br /&gt;
** The distributed architecture should still stay. Centralization of control of allowed information and access is terrible power. See China and the Middle-East.&lt;br /&gt;
** Information, for the most part, in itself, exists centrally (as per-page), though communities (to use a generic term) are distributed.&lt;br /&gt;
* Need more sophisticated natural language processing.&lt;br /&gt;
&lt;br /&gt;
= Class discussion =&lt;br /&gt;
&lt;br /&gt;
Focusing on vision, not the mechanism.&lt;br /&gt;
&lt;br /&gt;
* Reverse linking&lt;br /&gt;
* Distributed content distribution (glorified cache)&lt;br /&gt;
** Both for privacy and redunancy reasons&lt;br /&gt;
** Suggested centralized content certification, but doesn&#039;t address the problem of root of trust and distributed consistency checking.&lt;br /&gt;
*** Distributed key management is a holy grail&lt;br /&gt;
*** What about detecting large-scale subversion attempts, like in China&lt;br /&gt;
* What is the new revenue model?&lt;br /&gt;
** What was TBL&#039;s revenue model (tongue-in-cheek, none)?&lt;br /&gt;
** Organisations like Google monetized the internet, and this mechanism could destroy their ability to do so.&lt;br /&gt;
* Search work is semi-distributed. Suggested letting the web do the work for you.&lt;br /&gt;
* Trying to structure content in a manner simultaneously palatable to both humans and machines.&lt;br /&gt;
* Using spare CPU time on servers for natural language processing (or other AI) of cached or locally available resources.&lt;br /&gt;
* Imagine a smushed Wolfram Alpha, Google, Wikipedia, and Watson, and then distributed over the net.&lt;br /&gt;
* The document was TBL&#039;s idea of the atom of content, whereas nowaday we really need something more granular.&lt;br /&gt;
* We want to extract higher-level semantics.&lt;br /&gt;
* Google may not be pure keyword search anymore. It is essentially AI now, but we still struggle with expressing what we want to Google.&lt;br /&gt;
* What about the adversarial aspect of content hosters, vying for attention?&lt;br /&gt;
* People do actively try to fool you.&lt;br /&gt;
* Compare to Google News, though that is very specific to that domain. Their vision is a semantic web, but they are incrementally building it.&lt;br /&gt;
* In a scary fashion, Google is one of the central points of failure of the web. Even scarier is less technically competent people who depend on Facebook for that.&lt;br /&gt;
* There is a semantic gap between how we express and query information, and how AI understands it.&lt;br /&gt;
* Can think of Facebook as a distributed human search infrastructure.&lt;br /&gt;
* A core service of an operating system is locating information. &#039;&#039;&#039;Search is infrastructure.&#039;&#039;&#039;&lt;br /&gt;
* The problem is not purely technical. There are political and social aspects.&lt;br /&gt;
** Searching for a file on a local filesystem should have a unambiguous answer.&lt;br /&gt;
** Asking the web is a different thing. “What is the best chocolate bar?”&lt;br /&gt;
* Is the web a network database, as understood in COMP 3005, which we consider harmful.&lt;br /&gt;
* For two-way links, there is the problem of restructuring data and all the dependencies.&lt;br /&gt;
* Privacy issues when tracing paths across the web.&lt;br /&gt;
* What about the problem of information revocation?&lt;br /&gt;
* Need more augmented reality and distributed and micro payment systems.&lt;br /&gt;
* We need distributed, mutually untrusting social networks.&lt;br /&gt;
** Now we have the problem of storage and computation, but also take away some of of the monetizationable aspect.&lt;br /&gt;
* Distribution is not free. It is very expensive in very funny ways.&lt;br /&gt;
* The dream of harvesting all the computational power of the internet is not new.&lt;br /&gt;
** Startups have come and gone many times over that problem.&lt;br /&gt;
* Google&#039;s indexers understands quite well many documents on the web. However, it only &#039;&#039;&#039;presents&#039;&#039;&#039; a primitive keyword-like interface. It doesn&#039;t expose the ontology.&lt;br /&gt;
* Organising information does not necessarily mean applying an ontology to it.&lt;br /&gt;
* The organisational methods we now use don&#039;t use ontologies, but rather are supplemented by them.&lt;/div&gt;</summary>
		<author><name>Alp</name></author>
	</entry>
	<entry>
		<id>https://homeostasis.scs.carleton.ca/wiki/index.php?title=DistOS_2014W_Lecture_6&amp;diff=18485</id>
		<title>DistOS 2014W Lecture 6</title>
		<link rel="alternate" type="text/html" href="https://homeostasis.scs.carleton.ca/wiki/index.php?title=DistOS_2014W_Lecture_6&amp;diff=18485"/>
		<updated>2014-01-24T03:07:02Z</updated>

		<summary type="html">&lt;p&gt;Alp: Full integration of group-private discussion notes.&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;br /&gt;
&lt;br /&gt;
== Group Discussion on &amp;quot;The Early Web ==&lt;br /&gt;
&lt;br /&gt;
Questions to discuss:&lt;br /&gt;
&lt;br /&gt;
# How do you think the web would have been if not like the present way? &lt;br /&gt;
# What kind of infrastructure changes would you like to make? &lt;br /&gt;
&lt;br /&gt;
=== Group 1 ===&lt;br /&gt;
: Relatively satisfied with the present structure of the web some changes suggested are in the below areas: &lt;br /&gt;
* Make use of the greater potential of Protocols &lt;br /&gt;
* More communication and interaction capabilities.&lt;br /&gt;
* Implementation changes in the present payment method systems. Example usage of &amp;quot;Micro-computation&amp;quot; - a discussion we would get back to in future classes. Also, Cryptographic currencies.&lt;br /&gt;
* Augmented reality.&lt;br /&gt;
* More towards individual privacy. &lt;br /&gt;
&lt;br /&gt;
=== Group 2 ===&lt;br /&gt;
==== Problem of unstructured information ====&lt;br /&gt;
A large portion of the web serves content that is overwhelmingly concerned about presentation rather than structuring content. Tim Berner-Lees himself bemoaned the death of the semantic web. His original vision of it was as follows:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;!-- Code from Wikipedia&#039;s article on the semantic web, except for the block quoting form, which this MediaWiki instance doens&#039;t seem to support. --&amp;gt;&lt;br /&gt;
&amp;lt;blockquote&amp;gt;I have a dream for the Web [in which computers] become capable of analyzing all the data on the Web&amp;amp;nbsp;– the content, links, and transactions between people and computers. A &amp;quot;Semantic Web&amp;quot;, which makes this possible, has yet to emerge, but when it does, the day-to-day mechanisms of trade, bureaucracy and our daily lives will be handled by machines talking to machines. The &amp;quot;intelligent agents&amp;quot; people have touted for ages will finally materialize.&amp;lt;ref&amp;gt;{{cite book |last=Berners-Lee |first=Tim |authorlink=Tim Berners-Lee |coauthors=Fischetti, Mark |title=Weaving the Web |publisher=HarperSanFrancisco |year=1999 |pages=chapter 12 |isbn=978-0-06-251587-2 |nopp=true }}&amp;lt;/ref&amp;gt;&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
For this vision to be true, information arguably needs to be structured, maybe even classified. The idea of a universal information classification system has been floated. The modern web is mostly developed by software developers and similar, not librarians and the like.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;!-- TODO: Yahoo blurb. --&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Also, how does one differentiate satire from fact?&lt;br /&gt;
&lt;br /&gt;
==== Valuation and deduplication of information ====&lt;br /&gt;
Another problem common with the current wwww is the duplication of information. Redundancy is not in itself harmful to increase the availability of information, but is ad-hoc duplication of the information itself?&lt;br /&gt;
&lt;br /&gt;
One then comes to the problem of assigning a value to the information found therein. How does one rate information, and according to what criteria? How does one authenticate the information? Often, popularity is used as an indicator of veracity, almost in a sophistic manner. See excessive reliance on Google page ranking or Reddit score for various types of information consumption for research or news consumption respectively.&lt;br /&gt;
&lt;br /&gt;
=== On the current infrastructure ===&lt;br /&gt;
The current &amp;lt;em&amp;gt;internet&amp;lt;/em&amp;gt; infrastructure should remain as is, at least in countries with not just a modicum of freedom of access to information. Centralization of of control of access to information is a terrible power. See China and parts of the Middle-East. On that note, what can be said of popular sites, such as Google or Wikipedia that serve as the main entry point for many access patterns?&lt;br /&gt;
&lt;br /&gt;
The problem, if any, in the current web infrastructure is of the web itself, not the internet.&lt;br /&gt;
&lt;br /&gt;
=== Group 3 ===&lt;br /&gt;
* What we want to keep &lt;br /&gt;
** Linking mechanisms&lt;br /&gt;
** Minimum permissions to publish&lt;br /&gt;
* What we don&#039;t like&lt;br /&gt;
** Relying on one source for document &lt;br /&gt;
** Privacy links for security&lt;br /&gt;
* Proposal &lt;br /&gt;
** Peer-peer to distributed mechanisms for documenting&lt;br /&gt;
** Reverse links with caching - distributed cache&lt;br /&gt;
** More availability for user - what happens when system fails? &lt;br /&gt;
** Key management to be considered - Is it good to have centralized or distributed mechanism? &lt;br /&gt;
&lt;br /&gt;
=== Group 4 ===&lt;br /&gt;
* An idea of web searching for us &lt;br /&gt;
* A suggestion of a different web if it would have been implemented by &amp;quot;AI&amp;quot; people&lt;br /&gt;
** AI programs searching for data - A notion already being implemented by Google slowly.&lt;br /&gt;
* Generate report forums&lt;br /&gt;
* HTML equivalent is inspired by the AI communication&lt;br /&gt;
* Higher semantics apart from just indexing the data&lt;br /&gt;
** Problem : &amp;quot;How to bridge the semantic gap?&amp;quot;&lt;br /&gt;
** Search for more data patterns&lt;br /&gt;
&lt;br /&gt;
= Group design exercise — The web that could be =&lt;br /&gt;
&lt;br /&gt;
* “The web that wasn&#039;t” mentioned the moans of librarians.&lt;br /&gt;
* A universal classification system is needed.&lt;br /&gt;
* The training overhead of classifiers (e.g., librarians) is high. See the master&#039;s that a librarian would need.&lt;br /&gt;
* More structured content, both classification, and organization&lt;br /&gt;
* Current indexing by crude brute-force searching for words, etc., rather than searching metadata&lt;br /&gt;
* Information doesn&#039;t have the same persistence, see bitrot and Vint Cerf&#039;s talk.&lt;br /&gt;
* Too concerned with presentation now.&lt;br /&gt;
* Tim Berner-Lees bemoaning the death of the semantic web.&lt;br /&gt;
* The problem of information duplication when information gets redistributed across the web. However, we do want redundancy.&lt;br /&gt;
* Too much developed by software developers&lt;br /&gt;
* Too reliant on Google for web structure&lt;br /&gt;
** See search-engine optimization&lt;br /&gt;
* Problem of authentication (of the information, not the presenter)&lt;br /&gt;
** Too dependent at times on the popularity of a site, almost in a sophistic manner.&lt;br /&gt;
** See Reddit&lt;br /&gt;
* How do you programmatically distinguish satire from fact&lt;br /&gt;
* The web&#039;s structure is also “shaped by inbound links but would be nice a bit more”&lt;br /&gt;
* Infrastructure doesn&#039;t need to change per se.&lt;br /&gt;
** The distributed architecture should still stay. Centralization of control of allowed information and access is terrible power. See China and the Middle-East.&lt;br /&gt;
** Information, for the most part, in itself, exists centrally (as per-page), though communities (to use a generic term) are distributed.&lt;br /&gt;
* Need more sophisticated natural language processing.&lt;br /&gt;
&lt;br /&gt;
= Class discussion =&lt;br /&gt;
&lt;br /&gt;
Focusing on vision, not the mechanism.&lt;br /&gt;
&lt;br /&gt;
* Reverse linking&lt;br /&gt;
* Distributed content distribution (glorified cache)&lt;br /&gt;
** Both for privacy and redunancy reasons&lt;br /&gt;
** Suggested centralized content certification, but doesn&#039;t address the problem of root of trust and distributed consistency checking.&lt;br /&gt;
*** Distributed key management is a holy grail&lt;br /&gt;
*** What about detecting large-scale subversion attempts, like in China&lt;br /&gt;
* What is the new revenue model?&lt;br /&gt;
** What was TBL&#039;s revenue model (tongue-in-cheek, none)?&lt;br /&gt;
** Organisations like Google monetized the internet, and this mechanism could destroy their ability to do so.&lt;br /&gt;
* Search work is semi-distributed. Suggested letting the web do the work for you.&lt;br /&gt;
* Trying to structure content in a manner simultaneously palatable to both humans and machines.&lt;br /&gt;
* Using spare CPU time on servers for natural language processing (or other AI) of cached or locally available resources.&lt;br /&gt;
* Imagine a smushed Wolfram Alpha, Google, Wikipedia, and Watson, and then distributed over the net.&lt;br /&gt;
* The document was TBL&#039;s idea of the atom of content, whereas nowaday we really need something more granular.&lt;br /&gt;
* We want to extract higher-level semantics.&lt;br /&gt;
* Google may not be pure keyword search anymore. It is essentially AI now, but we still struggle with expressing what we want to Google.&lt;br /&gt;
* What about the adversarial aspect of content hosters, vying for attention?&lt;br /&gt;
* People do actively try to fool you.&lt;br /&gt;
* Compare to Google News, though that is very specific to that domain. Their vision is a semantic web, but they are incrementally building it.&lt;br /&gt;
* In a scary fashion, Google is one of the central points of failure of the web. Even scarier is less technically competent people who depend on Facebook for that.&lt;br /&gt;
* There is a semantic gap between how we express and query information, and how AI understands it.&lt;br /&gt;
* Can think of Facebook as a distributed human search infrastructure.&lt;br /&gt;
* A core service of an operating system is locating information. &#039;&#039;&#039;Search is infrastructure.&#039;&#039;&#039;&lt;br /&gt;
* The problem is not purely technical. There are political and social aspects.&lt;br /&gt;
** Searching for a file on a local filesystem should have a unambiguous answer.&lt;br /&gt;
** Asking the web is a different thing. “What is the best chocolate bar?”&lt;br /&gt;
* Is the web a network database, as understood in COMP 3005, which we consider harmful.&lt;br /&gt;
* For two-way links, there is the problem of restructuring data and all the dependencies.&lt;br /&gt;
* Privacy issues when tracing paths across the web.&lt;br /&gt;
* What about the problem of information revocation?&lt;br /&gt;
* Need more augmented reality and distributed and micro payment systems.&lt;br /&gt;
* We need distributed, mutually untrusting social networks.&lt;br /&gt;
** Now we have the problem of storage and computation, but also take away some of of the monetizationable aspect.&lt;br /&gt;
* Distribution is not free. It is very expensive in very funny ways.&lt;br /&gt;
* The dream of harvesting all the computational power of the internet is not new.&lt;br /&gt;
** Startups have come and gone many times over that problem.&lt;br /&gt;
* Google&#039;s indexers understands quite well many documents on the web. However, it only &#039;&#039;&#039;presents&#039;&#039;&#039; a primitive keyword-like interface. It doesn&#039;t expose the ontology.&lt;br /&gt;
* Organising information does not necessarily mean applying an ontology to it.&lt;br /&gt;
* The organisational methods we now use don&#039;t use ontologies, but rather are supplemented by them.&lt;/div&gt;</summary>
		<author><name>Alp</name></author>
	</entry>
	<entry>
		<id>https://homeostasis.scs.carleton.ca/wiki/index.php?title=DistOS_2014W_Lecture_6&amp;diff=18484</id>
		<title>DistOS 2014W Lecture 6</title>
		<link rel="alternate" type="text/html" href="https://homeostasis.scs.carleton.ca/wiki/index.php?title=DistOS_2014W_Lecture_6&amp;diff=18484"/>
		<updated>2014-01-24T02:14:39Z</updated>

		<summary type="html">&lt;p&gt;Alp: Appending raw dump of my notes (to edit later)&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;br /&gt;
&lt;br /&gt;
== Group Discussion on &amp;quot;The Early Web ==&lt;br /&gt;
&lt;br /&gt;
Questions to discuss:&lt;br /&gt;
&lt;br /&gt;
# How do you think the web would have been if not like the present way? &lt;br /&gt;
# What kind of infrastructure changes would you like to make? &lt;br /&gt;
&lt;br /&gt;
=== Group 1 ===&lt;br /&gt;
: Relatively satisfied with the present structure of the web some changes suggested are in the below areas: &lt;br /&gt;
* Make use of the greater potential of Protocols &lt;br /&gt;
* More communication and interaction capabilities.&lt;br /&gt;
* Implementation changes in the present payment method systems. Example usage of &amp;quot;Micro-computation&amp;quot; - a discussion we would get back to in future classes. Also, Cryptographic currencies.&lt;br /&gt;
* Augmented reality.&lt;br /&gt;
* More towards individual privacy. &lt;br /&gt;
&lt;br /&gt;
=== Group 2 ===&lt;br /&gt;
A large portion of the web serves content that is overwhelmingly concerned about presentation rather than structuring content. Tim Berner-Lees himself bemoaned the death of the semantic web.&lt;br /&gt;
&lt;br /&gt;
* Information to be classified in detail &lt;br /&gt;
** Organize things on web. Ex: Yahoo indexers&lt;br /&gt;
** Suggestion for the need of Universal Decimal System an idea by Paul Otlet to be considered. &lt;br /&gt;
** In the end it comes to semantic web&lt;br /&gt;
* Information redundancy&lt;br /&gt;
* Information verification&lt;br /&gt;
&lt;br /&gt;
=== Group 3 ===&lt;br /&gt;
* What we want to keep &lt;br /&gt;
** Linking mechanisms&lt;br /&gt;
** Minimum permissions to publish&lt;br /&gt;
* What we don&#039;t like&lt;br /&gt;
** Relying on one source for document &lt;br /&gt;
** Privacy links for security&lt;br /&gt;
* Proposal &lt;br /&gt;
** Peer-peer to distributed mechanisms for documenting&lt;br /&gt;
** Reverse links with caching - distributed cache&lt;br /&gt;
** More availability for user - what happens when system fails? &lt;br /&gt;
** Key management to be considered - Is it good to have centralized or distributed mechanism? &lt;br /&gt;
&lt;br /&gt;
=== Group 4 ===&lt;br /&gt;
* An idea of web searching for us &lt;br /&gt;
* A suggestion of a different web if it would have been implemented by &amp;quot;AI&amp;quot; people&lt;br /&gt;
** AI programs searching for data - A notion already being implemented by Google slowly.&lt;br /&gt;
* Generate report forums&lt;br /&gt;
* HTML equivalent is inspired by the AI communication&lt;br /&gt;
* Higher semantics apart from just indexing the data&lt;br /&gt;
** Problem : &amp;quot;How to bridge the semantic gap?&amp;quot;&lt;br /&gt;
** Search for more data patterns&lt;br /&gt;
&lt;br /&gt;
= Group design exercise — The web that could be =&lt;br /&gt;
&lt;br /&gt;
* “The web that wasn&#039;t” mentioned the moans of librarians.&lt;br /&gt;
* A universal classification system is needed.&lt;br /&gt;
* The training overhead of classifiers (e.g., librarians) is high. See the master&#039;s that a librarian would need.&lt;br /&gt;
* More structured content, both classification, and organization&lt;br /&gt;
* Current indexing by crude brute-force searching for words, etc., rather than searching metadata&lt;br /&gt;
* Information doesn&#039;t have the same persistence, see bitrot and Vint Cerf&#039;s talk.&lt;br /&gt;
* Too concerned with presentation now.&lt;br /&gt;
* Tim Berner-Lees bemoaning the death of the semantic web.&lt;br /&gt;
* The problem of information duplication when information gets redistributed across the web. However, we do want redundancy.&lt;br /&gt;
* Too much developed by software developers&lt;br /&gt;
* Too reliant on Google for web structure&lt;br /&gt;
** See search-engine optimization&lt;br /&gt;
* Problem of authentication (of the information, not the presenter)&lt;br /&gt;
** Too dependent at times on the popularity of a site, almost in a sophistic manner.&lt;br /&gt;
** See Reddit&lt;br /&gt;
* How do you programmatically distinguish satire from fact&lt;br /&gt;
* The web&#039;s structure is also “shaped by inbound links but would be nice a bit more”&lt;br /&gt;
* Infrastructure doesn&#039;t need to change per se.&lt;br /&gt;
** The distributed architecture should still stay. Centralization of control of allowed information and access is terrible power. See China and the Middle-East.&lt;br /&gt;
** Information, for the most part, in itself, exists centrally (as per-page), though communities (to use a generic term) are distributed.&lt;br /&gt;
* Need more sophisticated natural language processing.&lt;br /&gt;
&lt;br /&gt;
= Class discussion =&lt;br /&gt;
&lt;br /&gt;
Focusing on vision, not the mechanism.&lt;br /&gt;
&lt;br /&gt;
* Reverse linking&lt;br /&gt;
* Distributed content distribution (glorified cache)&lt;br /&gt;
** Both for privacy and redunancy reasons&lt;br /&gt;
** Suggested centralized content certification, but doesn&#039;t address the problem of root of trust and distributed consistency checking.&lt;br /&gt;
*** Distributed key management is a holy grail&lt;br /&gt;
*** What about detecting large-scale subversion attempts, like in China&lt;br /&gt;
* What is the new revenue model?&lt;br /&gt;
** What was TBL&#039;s revenue model (tongue-in-cheek, none)?&lt;br /&gt;
** Organisations like Google monetized the internet, and this mechanism could destroy their ability to do so.&lt;br /&gt;
* Search work is semi-distributed. Suggested letting the web do the work for you.&lt;br /&gt;
* Trying to structure content in a manner simultaneously palatable to both humans and machines.&lt;br /&gt;
* Using spare CPU time on servers for natural language processing (or other AI) of cached or locally available resources.&lt;br /&gt;
* Imagine a smushed Wolfram Alpha, Google, Wikipedia, and Watson, and then distributed over the net.&lt;br /&gt;
* The document was TBL&#039;s idea of the atom of content, whereas nowaday we really need something more granular.&lt;br /&gt;
* We want to extract higher-level semantics.&lt;br /&gt;
* Google may not be pure keyword search anymore. It is essentially AI now, but we still struggle with expressing what we want to Google.&lt;br /&gt;
* What about the adversarial aspect of content hosters, vying for attention?&lt;br /&gt;
* People do actively try to fool you.&lt;br /&gt;
* Compare to Google News, though that is very specific to that domain. Their vision is a semantic web, but they are incrementally building it.&lt;br /&gt;
* In a scary fashion, Google is one of the central points of failure of the web. Even scarier is less technically competent people who depend on Facebook for that.&lt;br /&gt;
* There is a semantic gap between how we express and query information, and how AI understands it.&lt;br /&gt;
* Can think of Facebook as a distributed human search infrastructure.&lt;br /&gt;
* A core service of an operating system is locating information. &#039;&#039;&#039;Search is infrastructure.&#039;&#039;&#039;&lt;br /&gt;
* The problem is not purely technical. There are political and social aspects.&lt;br /&gt;
** Searching for a file on a local filesystem should have a unambiguous answer.&lt;br /&gt;
** Asking the web is a different thing. “What is the best chocolate bar?”&lt;br /&gt;
* Is the web a network database, as understood in COMP 3005, which we consider harmful.&lt;br /&gt;
* For two-way links, there is the problem of restructuring data and all the dependencies.&lt;br /&gt;
* Privacy issues when tracing paths across the web.&lt;br /&gt;
* What about the problem of information revocation?&lt;br /&gt;
* Need more augmented reality and distributed and micro payment systems.&lt;br /&gt;
* We need distributed, mutually untrusting social networks.&lt;br /&gt;
** Now we have the problem of storage and computation, but also take away some of of the monetizationable aspect.&lt;br /&gt;
* Distribution is not free. It is very expensive in very funny ways.&lt;br /&gt;
* The dream of harvesting all the computational power of the internet is not new.&lt;br /&gt;
** Startups have come and gone many times over that problem.&lt;br /&gt;
* Google&#039;s indexers understands quite well many documents on the web. However, it only &#039;&#039;&#039;presents&#039;&#039;&#039; a primitive keyword-like interface. It doesn&#039;t expose the ontology.&lt;br /&gt;
* Organising information does not necessarily mean applying an ontology to it.&lt;br /&gt;
* The organisational methods we now use don&#039;t use ontologies, but rather are supplemented by them.&lt;/div&gt;</summary>
		<author><name>Alp</name></author>
	</entry>
	<entry>
		<id>https://homeostasis.scs.carleton.ca/wiki/index.php?title=DistOS_2014W_Lecture_6&amp;diff=18483</id>
		<title>DistOS 2014W Lecture 6</title>
		<link rel="alternate" type="text/html" href="https://homeostasis.scs.carleton.ca/wiki/index.php?title=DistOS_2014W_Lecture_6&amp;diff=18483"/>
		<updated>2014-01-24T02:12:37Z</updated>

		<summary type="html">&lt;p&gt;Alp: Convert previous content to more idiomatic markup (see pun on semantic web)&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;br /&gt;
&lt;br /&gt;
== Group Discussion on &amp;quot;The Early Web ==&lt;br /&gt;
&lt;br /&gt;
Questions to discuss:&lt;br /&gt;
&lt;br /&gt;
# How do you think the web would have been if not like the present way? &lt;br /&gt;
# What kind of infrastructure changes would you like to make? &lt;br /&gt;
&lt;br /&gt;
=== Group 1 ===&lt;br /&gt;
: Relatively satisfied with the present structure of the web some changes suggested are in the below areas: &lt;br /&gt;
* Make use of the greater potential of Protocols &lt;br /&gt;
* More communication and interaction capabilities.&lt;br /&gt;
* Implementation changes in the present payment method systems. Example usage of &amp;quot;Micro-computation&amp;quot; - a discussion we would get back to in future classes. Also, Cryptographic currencies.&lt;br /&gt;
* Augmented reality.&lt;br /&gt;
* More towards individual privacy. &lt;br /&gt;
&lt;br /&gt;
=== Group 2 ===&lt;br /&gt;
A large portion of the web serves content that is overwhelmingly concerned about presentation rather than structuring content. Tim Berner-Lees himself bemoaned the death of the semantic web.&lt;br /&gt;
&lt;br /&gt;
* Information to be classified in detail &lt;br /&gt;
** Organize things on web. Ex: Yahoo indexers&lt;br /&gt;
** Suggestion for the need of Universal Decimal System an idea by Paul Otlet to be considered. &lt;br /&gt;
** In the end it comes to semantic web&lt;br /&gt;
* Information redundancy&lt;br /&gt;
* Information verification&lt;br /&gt;
&lt;br /&gt;
=== Group 3 ===&lt;br /&gt;
* What we want to keep &lt;br /&gt;
** Linking mechanisms&lt;br /&gt;
** Minimum permissions to publish&lt;br /&gt;
* What we don&#039;t like&lt;br /&gt;
** Relying on one source for document &lt;br /&gt;
** Privacy links for security&lt;br /&gt;
* Proposal &lt;br /&gt;
** Peer-peer to distributed mechanisms for documenting&lt;br /&gt;
** Reverse links with caching - distributed cache&lt;br /&gt;
** More availability for user - what happens when system fails? &lt;br /&gt;
** Key management to be considered - Is it good to have centralized or distributed mechanism? &lt;br /&gt;
&lt;br /&gt;
=== Group 4 ===&lt;br /&gt;
* An idea of web searching for us &lt;br /&gt;
* A suggestion of a different web if it would have been implemented by &amp;quot;AI&amp;quot; people&lt;br /&gt;
** AI programs searching for data - A notion already being implemented by Google slowly.&lt;br /&gt;
* Generate report forums&lt;br /&gt;
* HTML equivalent is inspired by the AI communication&lt;br /&gt;
* Higher semantics apart from just indexing the data&lt;br /&gt;
** Problem : &amp;quot;How to bridge the semantic gap?&amp;quot;&lt;br /&gt;
** Search for more data patterns&lt;/div&gt;</summary>
		<author><name>Alp</name></author>
	</entry>
	<entry>
		<id>https://homeostasis.scs.carleton.ca/wiki/index.php?title=DistOS_2014W_Lecture_6&amp;diff=18482</id>
		<title>DistOS 2014W Lecture 6</title>
		<link rel="alternate" type="text/html" href="https://homeostasis.scs.carleton.ca/wiki/index.php?title=DistOS_2014W_Lecture_6&amp;diff=18482"/>
		<updated>2014-01-24T02:11:07Z</updated>

		<summary type="html">&lt;p&gt;Alp: Snippet from my notes.&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;br /&gt;
&lt;br /&gt;
== Group Discussion on &amp;quot;The Early Web ==&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Questions to discuss:&lt;br /&gt;
&lt;br /&gt;
# How do you think the web would have been if not like the present way? &lt;br /&gt;
: 2. What kind of infrastructure changes would you like to make? &lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Group 1&#039;&#039;&#039;&lt;br /&gt;
: Relatively satisfied with the present structure of the web some changes suggested are in the below areas: &lt;br /&gt;
* Make use of the greater potential of Protocols &lt;br /&gt;
* More communication and interaction capabilities.&lt;br /&gt;
* Implementation changes in the present payment method systems. Example usage of &amp;quot;Micro-computation&amp;quot; - a discussion we would get back to in future classes. Also, Cryptographic currencies.&lt;br /&gt;
* Augmented reality.&lt;br /&gt;
* More towards individual privacy. &lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Group 2&#039;&#039;&#039;&lt;br /&gt;
A large portion of the web serves content that is overwhelmingly concerned about presentation rather than structuring content. Tim Berner-Lees himself bemoaned the death of the semantic web.&lt;br /&gt;
&lt;br /&gt;
* Information to be classified in detail &lt;br /&gt;
** Organize things on web. Ex: Yahoo indexers&lt;br /&gt;
** Suggestion for the need of Universal Decimal System an idea by Paul Otlet to be considered. &lt;br /&gt;
** In the end it comes to semantic web&lt;br /&gt;
* Information redundancy&lt;br /&gt;
* Information verification&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Group 3&#039;&#039;&#039;&lt;br /&gt;
* What we want to keep &lt;br /&gt;
** Linking mechanisms&lt;br /&gt;
** Minimum permissions to publish&lt;br /&gt;
* What we don&#039;t like&lt;br /&gt;
** Relying on one source for document &lt;br /&gt;
** Privacy links for security&lt;br /&gt;
* Proposal &lt;br /&gt;
** Peer-peer to distributed mechanisms for documenting&lt;br /&gt;
** Reverse links with caching - distributed cache&lt;br /&gt;
** More availability for user - what happens when system fails? &lt;br /&gt;
** Key management to be considered - Is it good to have centralized or distributed mechanism? &lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Group 4&#039;&#039;&#039;&lt;br /&gt;
* An idea of web searching for us &lt;br /&gt;
* A suggestion of a different web if it would have been implemented by &amp;quot;AI&amp;quot; people&lt;br /&gt;
** AI programs searching for data - A notion already being implemented by Google slowly.&lt;br /&gt;
* Generate report forums&lt;br /&gt;
* HTML equivalent is inspired by the AI communication&lt;br /&gt;
* Higher semantics apart from just indexing the data&lt;br /&gt;
** Problem : &amp;quot;How to bridge the semantic gap?&amp;quot;&lt;br /&gt;
** Search for more data patterns&lt;/div&gt;</summary>
		<author><name>Alp</name></author>
	</entry>
	<entry>
		<id>https://homeostasis.scs.carleton.ca/wiki/index.php?title=DistOS_2014W_Lecture_5&amp;diff=18474</id>
		<title>DistOS 2014W Lecture 5</title>
		<link rel="alternate" type="text/html" href="https://homeostasis.scs.carleton.ca/wiki/index.php?title=DistOS_2014W_Lecture_5&amp;diff=18474"/>
		<updated>2014-01-23T12:36:10Z</updated>

		<summary type="html">&lt;p&gt;Alp: Initial raw note dump, redacted&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;= Introduction =&lt;br /&gt;
; Operating system&lt;br /&gt;
: The software that turns the computer you have into the one you want (Anil)&lt;br /&gt;
&lt;br /&gt;
* What sort of computer did we want to have?&lt;br /&gt;
* What sort of abstractions did they want to be easy? Hard?&lt;br /&gt;
* What could we build with the internet (not just WAN, but also LAN)?&lt;br /&gt;
* Most dreams people had of their computers smacked into the wall of reality.&lt;br /&gt;
&lt;br /&gt;
= MOAD review in groups =&lt;br /&gt;
&lt;br /&gt;
* Chorded keyboard unfortunately obscure, partly because the attendees disagreed with the long-term investment of training the user.&lt;br /&gt;
* View control → hyperlinking system, but in a lightweight (more like nanoweight) markup language.&lt;br /&gt;
* Ad-hoc ticketing system&lt;br /&gt;
* Ad-hoc messaging system&lt;br /&gt;
** Used on a time-sharing systme with shared storage,&lt;br /&gt;
* Primitive revision control system&lt;br /&gt;
* Different vocabulary:&lt;br /&gt;
** Bug and bug smear (mouse and trail)&lt;br /&gt;
** Point rather than click&lt;br /&gt;
&lt;br /&gt;
= Class review =&lt;br /&gt;
&lt;br /&gt;
* Doug died Jul 2 2013&lt;br /&gt;
* Doug himself called it an “online system”, rather than offline composition of code using card punchers as was common in the day.&lt;br /&gt;
* What became of the tech:&lt;br /&gt;
** Chorded keyboards:&lt;br /&gt;
*** Exist but obscure&lt;br /&gt;
** Pre-ARPANET network:&lt;br /&gt;
*** Time-sharing mainframe&lt;br /&gt;
*** 13 workstations&lt;br /&gt;
*** Telephone and television circuit&lt;br /&gt;
** Mouse&lt;br /&gt;
*** “I sometimes apologize for calling it a mouse”&lt;br /&gt;
** Collaborative document editing integrated with screen sharing&lt;br /&gt;
** Videoconferencing&lt;br /&gt;
*** Part of the vision, but more for the demo at the time,&lt;br /&gt;
** Hyperlinks&lt;br /&gt;
*** The web on a mainframe&lt;br /&gt;
** Languages&lt;br /&gt;
*** Metalanguages&lt;br /&gt;
**** “Part and parcel of their entire vision of augmenting human intelligence.”&lt;br /&gt;
**** You must teach the computer about the language you are using.&lt;br /&gt;
**** They were the use case. It was almost designed more for augmenting programmer intelligence rather than human intelligence.&lt;br /&gt;
*** It was normal for the time to build new languages (domain-specific) for new systems. Nowadays, we standardize on one but develop large APIs, at the expense of conciseness. We look for short-term benefits; we minimize programmer effort.&lt;br /&gt;
*** Compiler compiler&lt;br /&gt;
** Freeze-pane&lt;br /&gt;
** Folding—Zoomable UI (ZUI)&lt;br /&gt;
*** Lots of systems do it, but not the default&lt;br /&gt;
*** Much easier to just present everything.&lt;br /&gt;
** Technologies the required further investment got left behind.&lt;br /&gt;
* The NLS had little to no security&lt;br /&gt;
** There was a minimal notion of a user&lt;br /&gt;
** There was a utopian aspect. Meanwhile, the Mac had no utopian aspect. Data exchange was through floppies. Any network was small, local, ad-hoc, and among trusted peers.&lt;br /&gt;
** The system wasn&#039;t envisioned to scale up to masses of people who didn&#039;t trust each other.&lt;br /&gt;
** How do you enforce secrecy.&lt;br /&gt;
* Part of the reason for lack of adoption of some of the tech was hardware. We can posit that a bigger reason would be infrastructure.&lt;br /&gt;
* Differentiate usability of system from usability of vision&lt;br /&gt;
** What was missing was the polish, the ‘sexiness’, and the intuitiveness of later systems like the Apple II and the Lisa.&lt;br /&gt;
** The usability of the later Alto is still less than commercial systems.&lt;br /&gt;
*** The word processor was modal, which is apt to confuse unmotivated and untrained users.&lt;br /&gt;
* In the context of the Mother of All Demos, the Alto doesn&#039;t seem entirely revolutionary. Xerox PARC raided his team. They almost had a GUI; rather they had what we call today a virtual console, with a few things above.&lt;br /&gt;
* What happens with visionaries that present a big vision is that the spectators latch onto specific aspects.&lt;br /&gt;
* To be comfortable with not adopting the vision, one must ostracize the visionary. People pay attention to things that fit into their world view.&lt;br /&gt;
* Use cases of networking have changed little, though the means did&lt;br /&gt;
* Fundamentally a resource-sharing system; everything is shared, unlike later systems where you would need to explicitly do so. Resources shared fundamentally sense to share: documents, printers, etc.&lt;br /&gt;
* Resource sharing was never enough. &#039;&#039;&#039;Information-sharing&#039;&#039;&#039; was the focus.&lt;br /&gt;
&lt;br /&gt;
= Alto review =&lt;br /&gt;
&lt;br /&gt;
* Fundamentally a personal computer&lt;br /&gt;
* Applications:&lt;br /&gt;
** Drawing program with curves and arcs for drawing&lt;br /&gt;
** Hardware design tools (mostly logic boards)&lt;br /&gt;
** Time server&lt;br /&gt;
* Less designed for reading than the NLS. More designed around paper. Xerox had a laser printer, and you would read what you printed. Hypertext was deprioritized, unlike the NLS vision had focused on what could not be expressed on paper.&lt;br /&gt;
* Xerox had almost an obsession with making documents print beautifully.&lt;/div&gt;</summary>
		<author><name>Alp</name></author>
	</entry>
	<entry>
		<id>https://homeostasis.scs.carleton.ca/wiki/index.php?title=DistOS_2014W_Lecture_3&amp;diff=18391</id>
		<title>DistOS 2014W Lecture 3</title>
		<link rel="alternate" type="text/html" href="https://homeostasis.scs.carleton.ca/wiki/index.php?title=DistOS_2014W_Lecture_3&amp;diff=18391"/>
		<updated>2014-01-14T15:47:56Z</updated>

		<summary type="html">&lt;p&gt;Alp: /* Unclear portions */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;Questions to consider:&lt;br /&gt;
* What were the purposes envisioned for computer networks?  How do those compare with the uses they are put to today?&lt;br /&gt;
* What sort of resources were shared?  What resources are shared today?&lt;br /&gt;
* What network architecture did they envision?  Do we still have the same architecture?&lt;br /&gt;
* What surprised you about this paper?&lt;br /&gt;
* What was unclear?&lt;br /&gt;
&lt;br /&gt;
==Group 1==&lt;br /&gt;
* video was mostly a summary of Kahn&#039;s paper&lt;br /&gt;
* process migration through different zones of air traffic control&lt;br /&gt;
* &amp;quot;distributed OS&amp;quot; meant something different than we normally think about, because many people would log in remotely to a single machine, it is very much like cloud infrastructure that we talk about today&lt;br /&gt;
* alto paper makes reference to Kahn&#039;s paper, and the alto designers had the foresight to see that networks like arpanet would be necessary&lt;br /&gt;
* would it be useful to have a co-processor responsible for maintaining shared resources even today?  Like the IMPs of the arpanet?  Today, computers are usually so fast it doesn&#039;t really matter.&lt;br /&gt;
&lt;br /&gt;
=== Questions ===&lt;br /&gt;
&lt;br /&gt;
* What were the purposes envisioned for computer networks?&lt;br /&gt;
** big computation, storage, resource sharing - &amp;quot;having a library on a hard disk&amp;quot;&lt;br /&gt;
&lt;br /&gt;
* How do those compare with the uses they are put to today?&lt;br /&gt;
** those things are being done, but mostly communication like instant messaging, email&lt;br /&gt;
&lt;br /&gt;
* What sort of resources were shared?&lt;br /&gt;
** databases, CPU time&lt;br /&gt;
&lt;br /&gt;
* What resources are shared today?&lt;br /&gt;
** mostly storage&lt;br /&gt;
&lt;br /&gt;
* What network architecture did they envision?  &lt;br /&gt;
** they had a checksum and acknowledge on each packet&lt;br /&gt;
** the IMPs were the network interface and the routers&lt;br /&gt;
** packet-switching&lt;br /&gt;
&lt;br /&gt;
* Do we still have the same architecture?&lt;br /&gt;
** packet-switching definitely won&lt;br /&gt;
** no, now IP doesn&#039;t checksum or acknowledge, but TCP has end-to-end checksum and acknowledge&lt;br /&gt;
** Kahn went on to learn from the errors of arpanet to design TCP/IP&lt;br /&gt;
** the job of network interface and router have been decoupled&lt;br /&gt;
&lt;br /&gt;
* What surprised you about this paper?&lt;br /&gt;
** everything&lt;br /&gt;
** how they were able to do this&lt;br /&gt;
** a network interface card and router was the size of a fridge&lt;br /&gt;
** high-level languages&lt;br /&gt;
** bootstrap protocol, bootstrapping an application&lt;br /&gt;
** primitive computers&lt;br /&gt;
** desktop publishing&lt;br /&gt;
** the logistics of running a cable from one university to another&lt;br /&gt;
** how old the idea of distributed operating systems is&lt;br /&gt;
&lt;br /&gt;
* What was unclear?&lt;br /&gt;
** much of the more technical specifications, but we mostly skipped over those&lt;br /&gt;
&lt;br /&gt;
==Group 2==&lt;br /&gt;
1. The main purpose of early networks was resource sharing. Abstraction for transmission. Message reliability was a by-product. The underlying idea is the same.&lt;br /&gt;
&lt;br /&gt;
2. Specialized Hardware/software and information sharing. super set of sharing.&lt;br /&gt;
&lt;br /&gt;
3. AD-HOC routing, it was TCP without saying it. Largely unchanged today.&lt;br /&gt;
&lt;br /&gt;
==Group 3==&lt;br /&gt;
===Envisioned computer network purposes===&lt;br /&gt;
* Improving reliability of services, due to redundant resource sets&lt;br /&gt;
* Resource sharing&lt;br /&gt;
* Usage modes:t&lt;br /&gt;
** Users can use a remote terminal, from a remote office or home, to access those resources.&lt;br /&gt;
** Would allow centralization of resources, to improve ease of management and do away with inefficiencies&lt;br /&gt;
* Allow specialization of various sites. rather than each site trying to do it all&lt;br /&gt;
* Distributed simulations (notably air traffic control)&lt;br /&gt;
&lt;br /&gt;
Information-sharing is still relevant today, especially in research and large simulations. Remote access has mostly devolved into a specialized need.&lt;br /&gt;
&lt;br /&gt;
===Resources shared===&lt;br /&gt;
* Computing resources (especially expensive mainframes)&lt;br /&gt;
* Data sets&lt;br /&gt;
&lt;br /&gt;
===Network architecture===&lt;br /&gt;
* A primitive layered architecture&lt;br /&gt;
* Dedicated routing functions&lt;br /&gt;
* Various topologies:&lt;br /&gt;
** star&lt;br /&gt;
** loop&lt;br /&gt;
** bus&lt;br /&gt;
* Primarily (packet|mesage)-switched&lt;br /&gt;
** Circuit-switching too expensive and has large setup times&lt;br /&gt;
** Doesn&#039;t require committing resources&lt;br /&gt;
* Primitive flow control and buffering&lt;br /&gt;
* Predates proper congestion control such as Van Jacobsen&#039;s slow start&lt;br /&gt;
* Ad-hoc routing or based on something similar to RIP&lt;br /&gt;
* Anticipation of elephants and mice latency issues&lt;br /&gt;
* Unlike modern internet, error control and retransmission at every step&lt;br /&gt;
&lt;br /&gt;
The architecture today is similar, but the link-layer is very different: use of Ethernet and ATM. The modern internet is a collection of autonomous systems, not a single network. Routing propogation is now large-scale, and semi-automated (e.g., BGP externally, IS-IS and OSPF internally)&lt;br /&gt;
&lt;br /&gt;
===Surprising aspects===&lt;br /&gt;
&lt;br /&gt;
===Unclear portions===&lt;br /&gt;
* Weird packet format: Page 1400 (4 of PDF): “Node 6, discovering the message is for itself,&lt;br /&gt;
  replaces the destination address by the source address&lt;br /&gt;
&lt;br /&gt;
==Group 4==&lt;br /&gt;
&lt;br /&gt;
* What were the purposes envisioned for computer networks? How do those compare with the uses they are put to today?&lt;br /&gt;
&lt;br /&gt;
Networks were envisioned as providing remote access to other computers, because useful resources such as computing power, large databases, and non-portable software were local to a particular computer, not themselves shared over the network.&lt;br /&gt;
&lt;br /&gt;
Today, we use networks mostly for sharing data, although with services like Amazon AWS, we&#039;re starting to share computing resources again.  We&#039;re also moving to support collaboration (e.g. Google Docs, GitHub, etc.).&lt;br /&gt;
&lt;br /&gt;
* What sort of resources were shared? What resources are shared today?&lt;br /&gt;
&lt;br /&gt;
Computing power was the key resource being shared; today, it&#039;s access to data.  (See above.)&lt;br /&gt;
&lt;br /&gt;
* What network architecture did they envision? Do we still have the same architecture?&lt;br /&gt;
&lt;br /&gt;
Surprisingly, yes: modern networks have substantially similar architecures to the ones described in these papers.  &lt;br /&gt;
Packet-switched networks are now ubiquitous.  We no longer bother with circuit-switching even for telephony, in contrast to the assumption that non-network data would continue to use the circuit-switched common-carrier network.  &lt;br /&gt;
&lt;br /&gt;
* What surprised you about this paper?&lt;br /&gt;
&lt;br /&gt;
We were surprised by the accuracy of the predictions given how early the paper was written.  Also surprising were technological advances since the paper was written, such as data transfer speeds (we have networks that are faster than the integrated bus in the Alto), and the predicted resolution requirements (which we are nowhere near meeting).  The amount of detail in the description of the &#039;mouse pointing device&#039; was interesting too.&lt;br /&gt;
&lt;br /&gt;
* What was unclear? &lt;br /&gt;
&lt;br /&gt;
Nothing significant; we&#039;re looking at these with the benefit of hindsight.&lt;/div&gt;</summary>
		<author><name>Alp</name></author>
	</entry>
	<entry>
		<id>https://homeostasis.scs.carleton.ca/wiki/index.php?title=DistOS_2014W_Lecture_3&amp;diff=18390</id>
		<title>DistOS 2014W Lecture 3</title>
		<link rel="alternate" type="text/html" href="https://homeostasis.scs.carleton.ca/wiki/index.php?title=DistOS_2014W_Lecture_3&amp;diff=18390"/>
		<updated>2014-01-14T15:43:13Z</updated>

		<summary type="html">&lt;p&gt;Alp: /* Group 3 */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;Questions to consider:&lt;br /&gt;
* What were the purposes envisioned for computer networks?  How do those compare with the uses they are put to today?&lt;br /&gt;
* What sort of resources were shared?  What resources are shared today?&lt;br /&gt;
* What network architecture did they envision?  Do we still have the same architecture?&lt;br /&gt;
* What surprised you about this paper?&lt;br /&gt;
* What was unclear?&lt;br /&gt;
&lt;br /&gt;
==Group 1==&lt;br /&gt;
* video was mostly a summary of Kahn&#039;s paper&lt;br /&gt;
* process migration through different zones of air traffic control&lt;br /&gt;
* &amp;quot;distributed OS&amp;quot; meant something different than we normally think about, because many people would log in remotely to a single machine, it is very much like cloud infrastructure that we talk about today&lt;br /&gt;
* alto paper makes reference to Kahn&#039;s paper, and the alto designers had the foresight to see that networks like arpanet would be necessary&lt;br /&gt;
* would it be useful to have a co-processor responsible for maintaining shared resources even today?  Like the IMPs of the arpanet?  Today, computers are usually so fast it doesn&#039;t really matter.&lt;br /&gt;
&lt;br /&gt;
=== Questions ===&lt;br /&gt;
&lt;br /&gt;
* What were the purposes envisioned for computer networks?&lt;br /&gt;
** big computation, storage, resource sharing - &amp;quot;having a library on a hard disk&amp;quot;&lt;br /&gt;
&lt;br /&gt;
* How do those compare with the uses they are put to today?&lt;br /&gt;
** those things are being done, but mostly communication like instant messaging, email&lt;br /&gt;
&lt;br /&gt;
* What sort of resources were shared?&lt;br /&gt;
** databases, CPU time&lt;br /&gt;
&lt;br /&gt;
* What resources are shared today?&lt;br /&gt;
** mostly storage&lt;br /&gt;
&lt;br /&gt;
* What network architecture did they envision?  &lt;br /&gt;
** they had a checksum and acknowledge on each packet&lt;br /&gt;
** the IMPs were the network interface and the routers&lt;br /&gt;
** packet-switching&lt;br /&gt;
&lt;br /&gt;
* Do we still have the same architecture?&lt;br /&gt;
** packet-switching definitely won&lt;br /&gt;
** no, now IP doesn&#039;t checksum or acknowledge, but TCP has end-to-end checksum and acknowledge&lt;br /&gt;
** Kahn went on to learn from the errors of arpanet to design TCP/IP&lt;br /&gt;
** the job of network interface and router have been decoupled&lt;br /&gt;
&lt;br /&gt;
* What surprised you about this paper?&lt;br /&gt;
** everything&lt;br /&gt;
** how they were able to do this&lt;br /&gt;
** a network interface card and router was the size of a fridge&lt;br /&gt;
** high-level languages&lt;br /&gt;
** bootstrap protocol, bootstrapping an application&lt;br /&gt;
** primitive computers&lt;br /&gt;
** desktop publishing&lt;br /&gt;
** the logistics of running a cable from one university to another&lt;br /&gt;
** how old the idea of distributed operating systems is&lt;br /&gt;
&lt;br /&gt;
* What was unclear?&lt;br /&gt;
** much of the more technical specifications, but we mostly skipped over those&lt;br /&gt;
&lt;br /&gt;
==Group 2==&lt;br /&gt;
1. The main purpose of early networks was resource sharing. Abstraction for transmission. Message reliability was a by-product. The underlying idea is the same.&lt;br /&gt;
&lt;br /&gt;
2. Specialized Hardware/software and information sharing. super set of sharing.&lt;br /&gt;
&lt;br /&gt;
3. AD-HOC routing, it was TCP without saying it. Largely unchanged today.&lt;br /&gt;
&lt;br /&gt;
==Group 3==&lt;br /&gt;
===Envisioned computer network purposes===&lt;br /&gt;
* Improving reliability of services, due to redundant resource sets&lt;br /&gt;
* Resource sharing&lt;br /&gt;
* Usage modes:t&lt;br /&gt;
** Users can use a remote terminal, from a remote office or home, to access those resources.&lt;br /&gt;
** Would allow centralization of resources, to improve ease of management and do away with inefficiencies&lt;br /&gt;
* Allow specialization of various sites. rather than each site trying to do it all&lt;br /&gt;
* Distributed simulations (notably air traffic control)&lt;br /&gt;
&lt;br /&gt;
Information-sharing is still relevant today, especially in research and large simulations. Remote access has mostly devolved into a specialized need.&lt;br /&gt;
&lt;br /&gt;
===Resources shared===&lt;br /&gt;
* Computing resources (especially expensive mainframes)&lt;br /&gt;
* Data sets&lt;br /&gt;
&lt;br /&gt;
===Network architecture===&lt;br /&gt;
* A primitive layered architecture&lt;br /&gt;
* Dedicated routing functions&lt;br /&gt;
* Various topologies:&lt;br /&gt;
** star&lt;br /&gt;
** loop&lt;br /&gt;
** bus&lt;br /&gt;
* Primarily (packet|mesage)-switched&lt;br /&gt;
** Circuit-switching too expensive and has large setup times&lt;br /&gt;
** Doesn&#039;t require committing resources&lt;br /&gt;
* Primitive flow control and buffering&lt;br /&gt;
* Predates proper congestion control such as Van Jacobsen&#039;s slow start&lt;br /&gt;
* Ad-hoc routing or based on something similar to RIP&lt;br /&gt;
* Anticipation of elephants and mice latency issues&lt;br /&gt;
* Unlike modern internet, error control and retransmission at every step&lt;br /&gt;
&lt;br /&gt;
The architecture today is similar, but the link-layer is very different: use of Ethernet and ATM. The modern internet is a collection of autonomous systems, not a single network. Routing propogation is now large-scale, and semi-automated (e.g., BGP externally, IS-IS and OSPF internally)&lt;br /&gt;
&lt;br /&gt;
===Surprising aspects===&lt;br /&gt;
&lt;br /&gt;
===Unclear portions===&lt;br /&gt;
* Weird packet format: Page 1400 (4 of PDF): “Node 6, discovering the message is for itself,&lt;br /&gt;
  replaces the destination address by the source address”.&lt;br /&gt;
&lt;br /&gt;
==Group 4==&lt;br /&gt;
&lt;br /&gt;
* What were the purposes envisioned for computer networks? How do those compare with the uses they are put to today?&lt;br /&gt;
&lt;br /&gt;
Networks were envisioned as providing remote access to other computers, because useful resources such as computing power, large databases, and non-portable software were local to a particular computer, not themselves shared over the network.&lt;br /&gt;
&lt;br /&gt;
Today, we use networks mostly for sharing data, although with services like Amazon AWS, we&#039;re starting to share computing resources again.  We&#039;re also moving to support collaboration (e.g. Google Docs, GitHub, etc.).&lt;br /&gt;
&lt;br /&gt;
* What sort of resources were shared? What resources are shared today?&lt;br /&gt;
&lt;br /&gt;
Computing power was the key resource being shared; today, it&#039;s access to data.  (See above.)&lt;br /&gt;
&lt;br /&gt;
* What network architecture did they envision? Do we still have the same architecture?&lt;br /&gt;
&lt;br /&gt;
Surprisingly, yes: modern networks have substantially similar architecures to the ones described in these papers.  &lt;br /&gt;
Packet-switched networks are now ubiquitous.  We no longer bother with circuit-switching even for telephony, in contrast to the assumption that non-network data would continue to use the circuit-switched common-carrier network.  &lt;br /&gt;
&lt;br /&gt;
* What surprised you about this paper?&lt;br /&gt;
&lt;br /&gt;
We were surprised by the accuracy of the predictions given how early the paper was written.  Also surprising were technological advances since the paper was written, such as data transfer speeds (we have networks that are faster than the integrated bus in the Alto), and the predicted resolution requirements (which we are nowhere near meeting).  The amount of detail in the description of the &#039;mouse pointing device&#039; was interesting too.&lt;br /&gt;
&lt;br /&gt;
* What was unclear? &lt;br /&gt;
&lt;br /&gt;
Nothing significant; we&#039;re looking at these with the benefit of hindsight.&lt;/div&gt;</summary>
		<author><name>Alp</name></author>
	</entry>
</feed>