- For the world's first web browser, see WorldWideWeb.
Graphic representation of the World Wide Web around Wikipedia
The World Wide Web ("WWW", "W3", or simply "Web") is an information space in which the items of interest, referred to as resources, are identified by global identifiers called Uniform Resource Identifiers (URIs). The term is often mistakenly used as a synonym for the Internet, but the Web is actually a service that operates over the Internet.
Basic terms
Hypertext is viewed using a program called a web browser which retrieves pieces of information, called "documents" or "web pages", from web servers and displays them, typically on a computer monitor. One can then follow hyperlinks on each page to other documents or even send information back to the server to interact with it. The act of following hyperlinks is often called "surfing" or "browsing" the Web. Web pages are often arranged in collections of related material called "websites."
The phrase "surfing the Internet" was first popularized in print by Jean Armour Polly, a librarian, in an article called Surfing the INTERNET, published in the Wilson Library Bulletin in June, 1992. Although Polly developed the phrase independently, slightly earlier uses of similar terms have been found on the Usenet from 1991 and 1992, and some recollections claim it was also used verbally in the hacker community for a couple years before that. Polly is famous as "NetMom" in the world of Internet.
For more information on the distinction between the World Wide Web and the Internet itself — as in everyday use the two are sometimes confused — see Dark internet where this is discussed in more detail.
Although the English word worldwide is normally written as one word (without a space or hyphen), the proper name World Wide Web and abbreviation WWW are now well-established even in formal English. The earliest references to the Web called it the WorldWideWeb (an example of computer programmers' fondness for intercaps) or the World-Wide Web (with a hyphen, this version of the name is the closest to normal English usage).
How the web works
When you want to access a web page, or other "resource", on the World Wide Web, you normally begin either by typing the URL of the page into your browser, or by following a hypertext link to that page or resource. The first step, behind the scenes, is for the server-name part of the URL to be resolved into an IP address by the global, distributed Internet database known as the Domain name system or DNS.
The next step is for an HTTP request to be sent to the web server working at that IP address for the page required. In the case of a typical web page, the HTML text, graphics and any other files that form a part of the page will be requested and returned to the client in quick succession.
The web browser's job is then to render the page as described by the HTML, CSS and other files received, incorporating the images, links and other resources as necessary. This produces the on-screen 'page' that you see.
Most web pages will, themselves, contain hyperlinks to other relevant and informative pages and perhaps to downloads, source documents, definitions and other web resources.
Such a collection of useful, related resources, interconnected via hypertext links, is what has been dubbed a 'web' of information. Making it available on the Internet produced what Tim Berners-Lee first called the World Wide Web in the early 1990s [1] [2].
Origins
See also: History of the Internet
This NeXTcube used by Berners-Lee at CERN became the first Web server.
The Web can be traced back to a project at the European Organization for Nuclear Research (CERN) in 1989 when Tim Berners-Lee and Robert Cailliau built ENQUIRE (short for Enquire Within Upon Everything, a book Berners-Lee recalled from his youth).
While it was rather different from the Web we use today, it contained many of the same core ideas (and even some of the ideas of Berners-Lee's next project, the Semantic Web). Berners-Lee mentions that much of the motivation behind the project was so that he could access library information that was scattered on several different servers at CERN.
Tim Berners-Lee published a more formal proposal for the actual World Wide Web on November 12, 1990 [3]. Implementation began on November 13, 1990 when Berners-Lee wrote the first Web page [4] on a NeXT workstation.
During the Christmas holiday of that year, Berners-Lee built all the tools necessary for a working Web [5]: the first Web browser (which was a Web editor as well) and the first Web server.
On August 6, 1991, he posted a short summary of the World Wide Web project on the alt.hypertext newsgroup. This date also marked the debut of the Web as a publicly available service on the Internet.
The crucial underlying concept of hypertext originated with older projects from the 1960s, such as Ted Nelson's Project Xanadu and Douglas Engelbart's oN-Line System (NLS). Both Nelson and Engelbart were in turn inspired by Vannevar Bush's microfilm-based "memex," which was described in the 1945 essay "As We May Think".
Berners-Lee's brilliant breakthrough was to marry hypertext to the Internet. In his book Weaving The Web, he explains that he had repeatedly suggested that a marriage between the two technologies was possible to members of both technical communities, but when no one took up his invitation, he finally tackled the project himself. In the process, he developed a system of globally unique identifiers for resources on the Web and elsewhere: the Uniform Resource Identifier.
The World Wide Web had a number of differences from other hypertext systems that were then available.
- The WWW required only unidirectional links rather than bidirectional ones. This made it possible for someone to link to another resource without action by the owner of that resource. It also significantly reduced the difficulty of implementing Web servers and browsers (in comparison to earlier systems), but in turn presented the chronic problem of broken links.
- Unlike certain applications such as HyperCard or Gopher, the World Wide Web was non-proprietary, making it possible to develop servers and clients independently and to add extensions without licensing restrictions.
On April 30, 1993, CERN announced that the World Wide Web would be free to anyone, with no fees due.
Web standards
At its core, the Web is made up of three standards:
- the Uniform Resource Identifier (URI), which is a universal system for referencing resources on the Web, such as Web pages;
- the HyperText Transfer Protocol (HTTP), which specifies how the browser and server communicate with each other; and
- the HyperText Markup Language (HTML), used to define the structure and content of hypertext documents.
Berners-Lee now heads the World Wide Web Consortium (W3C), which develops and maintains these and other standards that enable computers on the Web to effectively store and communicate different forms of information.
Beyond text
Screenshot of the very first WWW browser running on a NeXT system
The initial WorldWideWeb program at CERN displayed styled text and images, and it was a WYSIWYG HTML editor as well as the browser. Tim Berners-Lee originally made no distinction between surfing (reading) and editing web documents.
As it ran only on NeXT machines, CERN released a simple, text-only version to the world. Some journalists first encountered the Web through the text browser written by Nicola Pellow and this engendered a myth that the Web was text-only until Mosaic came along. The Web had graphics from the start, at least for the minority of NeXT users.
Meanwhile, browsers such as Tony Johnson's "Midas" and Pei-Yuan Wei's Viola (1991) added the ability to display graphics also on other Unix machines. In 1993, Marc Andreessen of NCSA released their browser, called "Mosaic for X", that sparked a tremendous rise in the popularity of the Web among novice users. Andreessen went on to found Mosaic Communications Corporation (now Netscape Communications Corporation, a unit of Time Warner). Additional features such as dynamic content, music and animation can be found in modern browsers.
Browser makers do not always adhere to the standards set forth by the W3C, so it is not uncommon for these newer features not to work properly on all browsers. The ever-improving technical capability of the WWW has enabled the development of real-time web-based services such as webcasts, Internet radio and live web cams.
Java and JavaScript
Another significant advance in the technology was Sun Microsystems' Java programming language, which initially enabled web servers to embed small programs (called applets) directly into the information being served that would run on the user's computer, allowing faster and richer user interaction, but came to be more widely used as a tool for generating complex server-side content as it is requested.
JavaScript, however, is a scripting language that was developed for Web pages (the standardized version is ECMAScript). While its name is similar to Java it was developed by Netscape and not Sun Microsystems (and has almost nothing to do with it). In conjunction with the Document Object Model, JavaScript has become a much more powerful language than its creators originally envisioned. Sometimes its usage is expressed under the term Dynamic HTML (DHTML), to emphasize a shift away from static HTML pages.
Sociological implications
The exponential growth of the Internet was primarily attributed to the emergence of the web browser Mosaic, followed by its commercial offspring Netscape Navigator, during the mid-1990s.
It brought unprecedented attention to the Internet from media, industries, policy makers, and the general public.
Eventually, it led to several visions of how modern societies might change into information societies, although some point out that those visions are not unique to the Internet, but repeated with many new technologies (especially information and communications technologies) of various eras.
Because the Web is global in scale, some suggested that it will nurture mutual understanding on a global scale.
Publishing web pages
The Web is available to individuals outside mass media. In order to "publish" a web page, one does not have to go through a publisher or other media institution, and potential readers could be found in all corners of the globe.
Unlike books and documents, hypertext does not have a linear order from beginning to end. It is not broken down into the hierarchy of chapters, sections, subsections, etc.
Many different kinds of information are now available on the Web, and for those who wish to know other societies, their cultures and peoples, it has become easier. When travelling in a foreign country or a remote town, one might be able to find some information about the place on the web, especially if the place is in one of the developed countries. Local newspapers, government publications, and other materials are easier to access, and therefore the variety of information obtainable with the same effort may be said to have increased, for the users of the Internet.
Although some websites are available in multiple languages, many are in the local language only. Also, not all software supports all special characters, and RTL languages. These factors would challenge the notion that the World Wide Web will bring a unity to the world.
The increased opportunity to publish materials is certainly observable in the countless personal pages, as well as pages by families, small shops, etc., facilitated by the emergence of free web hosting services.
Statistics
According to a 2001 study [6], there were more than 550 billion documents on the Web, mostly in the "invisible web". A 2002 survey of 2,024 million web pages [7] determined that by far the most Web content was in English: 56.4%; next were pages in German (7.7%), French (5.6%) and Japanese (4.9%). There seem to be an increase in Chinese sites since however. A more recent study [8] which used web searches in 75 different languages to sample the web determined that there were over 11.5 billion web pages in the publically-indexable web as of January 2005.
Speed issues
Frustration over congestion issues in the Internet infrastructure and the high latency that results in slow browsing has lead to an alternative name for the Web: the World Wide Wait. Speeding up the Internet is an ongoing discussion over the use of peering and QoS technologies. Other solutions to reduce the World Wide Wait can be found on W3C.
Academic conferences
The major academic event covering the WWW is the World Wide Web series of conferences, promoted by IW3C2. There is a list with links to all conferences in the series.
Pronunciation of "www"
Most English-speaking people pronounce the 9-syllable letter sequence www used in some domain names for websites as "double U, double U, double U" despite shorter options like "triple double U" being available.
Some languages do not have the letter w in their alphabet (for example, Italian), which leads some people to pronounce www as "vou, vou, vou." In some languages (such as Czech and Finnish) the w is substituted by a v, so Czechs pronounce www as "veh, veh, veh" rather than the correct but much longer pronunciation "dvojité veh, dvojité veh, dvojité veh;" the same applies to Finnish, where the correct pronounciation would be "kaksoisvee, kaksoisvee, kaksoisvee." Several other languages (e.g. German, Dutch etc.) simply pronounce the letter W as a single syllable, so this problem doesn't occur.
Depending on how the domain and web server are set up, a www website can often be accessed without entering the "www.", as long as the ".com" or other appropriate top-level domain is appended. Even this is not always necessary as some browsers will automatically try adding "www." and ".com" to typed URIs if a web page isn't found without them.
In English pronunciation, saying the full words "World Wide Web" takes one-third as many syllables as saying the initialism "www". According to Berners-Lee, others mentioned this fact as a reason to choose a different name, but he persisted.
See also
References
- Fielding, R.; Gettys, J.; Mogul, J.; Frystyk, H.; Masinter, L.; Leach, P.; Berners-Lee, T. (June 1999). "Hypertext Transfer Protocol - HTTP/1.1." Request For Comments 2616. Information Sciences Institute.
- Berners-Lee, Tim; Bray, Tim; Connolly, Dan; Cotton, Paul; Fielding, Roy; Jeckle, Mario; Lilley, Chris; Mendelsohn, Noah; Orchard, David; Walsh, Norman; Williams, Stuart (December 15, 2004). "Architecture of the World Wide Web, Volume One." Version 20041215. W3C.
- Polo, Luciano (2003). "World Wide Web Technology Architecture: A Conceptual Analysis." New Devices. . Accessed on July 31, 2005.
External links
Standards
The following is a cursory list of the documents that define the World Wide Web's three core standards:
- Uniform Resource Locator (URL)
- Hypertext Markup Language (HTML)
- HyperText Transfer Protocol (HTTP)
|