The Internet And The Web
The internet and the web: are these just two different words for the same thing? As it turns out, no they're not, although these days the two terms often get used interchangeably. In this article I will attempt to explain what each of these words mean, and what the differences are between them.
The Internet - A Network of Networks
The principles on which the internet is based are very simple, and that is part of the reason why it has become so successful. In order to understand them, we're going to imagine that the internet does not exist yet, and we've been tasked with building it from scratch. Conveniently we happen to have a limitless supply of computers, cables and other bits and bobs lying around, so let's get started.
First, we're going to take a small group of ten computers and connect them all together with cables so that they can talk to one another. This construction, a group of computers in communication with one another, is called a network. While they're connected, they can send all sorts of information back and forth, including (but not limited to): webpages, emails, messages, images and videos. This seems quite useful, so let's create a few more groups like this, perhaps spread out over a few different geographical locations across the country, so that other groups of people can benefit.
So far, so good, but there's a limitation to this system. If someone is on network A and wants access to a file on network B, they're required to travel across to the physical location of network B in order to retrieve it. This isn't so great, and rather limits the use of the networks that we're building, so let's update things a bit. We're going to take a bundle of extremely long cables and use them to connect together all the disparate networks that are spread across the country, resulting in a much larger 'network of networks'. This is a term commonly used to describe the internet, but we haven't quite finished yet.
Now, we have expanded our network to include computers all over the country, but what about other nations? They have a lot of interesting stuff going on too, and we're quite keen on sharing things with them. Let's do that by getting some even larger (and more robust, some of these places are overseas) cables and using them to connect up all the different countries around the world. Excellent, we now have something that closely resembles the ARPANET, the precursor to the internet that was used to connect up academic institutions all over the world. To get to the modern day internet we add a few more techniques for sending information across the cables, allow pretty much anyone with a computer to connect to our network of networks, and then sit back to marvel at how rapidly this thing starts to grow.
Addresses
Now, we have this massive network of computers all connected up to one another, and that's great, but at this point you might be wondering something. Say you were using one of the computers in this network and you wanted to send a message to another, specific computer, perhaps one owned by a friend. How on Earth do you identify her computer among the thousands of other machines? Well, as it turns out, this problem is analogous to the one faced by postal systems around the world, and the internet uses a similar solution.
Just as every house has an address (and possibly a postcode), every computer on the network also has an address, called an Internet Protocol (IP) address, by which it can be identified. In the physical world, if you want to send a letter to someone you drop it into an envelope carrying the address of the recipient, and send it off into the postal system. Similarly, whenever you want to send a message to another computer, you simply add an IP address and send it off into the network.
Domain Names
The above explains how computers find one another on a network, but it leads to another question. If IP addresses are so vital to every interaction that occurs on the internet, how come we, the users, don't see all that much of them? There are a few reasons, but the most basic one is that they are frankly awkward for humans to work with. They currently come in two main forms:
- IPv4 - A collection of four group of numbers, e.g. 192.168.1.100.
- IPv6 - A collection of eight groups of hexadecimal digits, e.g. 02a3:54cb:0002:10ff:edc8:7dd5:16aa:cb86.
Imagine having to type one of those in every time you wanted to visit a website or send an email to someone. This is an example of a common situation, in which computers and humans do not see eye-to-eye on the formats they each like to work with. Computers like IP addresses, but humans would prefer something more readable.
The solution is to let humans work with memorable names like google.co.uk
or facebook.com
. These then get translated into the versions that computers like, IP addresses, behind the scenes. The framework that allows is referred to as the Domain Name System, or DNS for short, and its most common use is to allow humans to access stuff over the internet using these memorable names.
DNS works much like a contacts book (or app). Instead of having to memorise every piece of contact information about everyone you know, you look up their name in your contacts, and from there you get their phone number, address, and whatever else you need. Similarly, around the internet are distributed a number of special computers called DNS servers, that act like contact books for all the other computers on the network. When a computer is given an address in, as far as it is concerned, awkward human form, it goes to the nearest DNS server and asks for the IP address that this human address corresponds to. It can then exchange messages with the computer located at that address.
It is important to remember that DNS is not a compulsory part of the internet, which could function without it. It is an application added to make the network of networks more accessible for human use. This concept of an application, running on top of the internet, as it were, is fundamental to understanding the distinction between the web and the internet itself.
The Web - Running On Top of the Internet
In the description of the internet above, I endeavoured to steer clear of any discussion of websites or the web in general. I did this to avoid adding to the confusion that often leads people to believe the two technologies are in fact one and the same. Part of the reason that this happens is that the web is the internet's biggest success story, and the thing that most people tend to use the internet for. However, the web is in fact just one of many services that the internet is used to provide.
The internet, at its heart, is just a series of connections between computers. It doesn't know anything about webpages, or emails, or videos of cats; it simply carries information back and forth as instructed. Over the years a series of different applications, or 'apps', have been developed to make use of the internet for sending information back and forth. The web is one such application. It uses special groupings of letters and numbers, known as URLs (or Uniform Resource Locators if you like), to let people access files on computers all around the world. In the most common case, a web browser is used to get these files, and display them to the user. The outcome: millions and millions of websites accessible from billions of devices across the world.
Now, if the web is the most widely used application running on the internet, then email is probably next on the list. Email is pretty much exactly what the name says: electronic mail. It is a postal system running on top of the internet, with an email address acting as the equivalent to a postal address, and attachments taking the place of a parcel system. Now, to be clear, email itself is a completely separate application to the web, and uses a different type of communication system to make its way across the internet. An email provider may, at times, allow you to log in to access your email through a website, but the actual transmission of messages occurs separately.
In Summary
The internet is a global series of interconnected networks, allowing the computers that are part of it to communicate with one another. There are, however, many different ways of sending information across this network of networks, and a range of applications that run on top of it to provide services for us. The web is one of these applications, and allows us to view information and media stored on computers across the world in a rich, elegant format. Email is another application, one that provides us with a means of sending messages, and sometimes files, to people around the world, using a single-line address. A discussion of some of the other, less well-known applications is beyond the scope of this article, but I will finish with a list of some common communication methods and their corresponding applications, in case anyone wishes to go further:
- HTTP
- Hypertext Transfer Protocol, the main method of sending information on the web.
- HTTPS
- HTTP over TLS/SSL, the same protocol as before, but 'wrapped', as it were, in another protocol for secure (encrypted) communication.
- IMAP, POP, SMTP
- Internet Message Access Protocol, Post Office Protocol, Simple Mail Transfer Protocol, the common methods of communication used when working with email.
- FTP
- File Transfer Protocol, used to transfer files over the internet.
- SSH
- Secure Shell, a protocol used to log into and control another computer remotely over the network.
- NTP
- Network Time Protocol, used to provide time information and allow computers to set their clocks over the network.
...and many others.