The (sorry) state of the www

The web today isn't the way more content-centric and naive-but-simple web we used to have in the 90s and early 2000s, but has developed a much stronger focus on tracking and trying to influence what people think instead of being a source of more or less freely and actively shared information. Additionally, a lot of sites seem to care today more about presentation than content, resulting in many shiny, but bloated and less useful sites.

The bloat is comical sometimes; even heavily used sites that would benefit directly from leaner approaches somehow find it normal to use insane amounts of resources, leading to pointlessly increased traffic, CPU load, energy consumption and ultimately carbon footprint. e.g. a standard slack web chat session with no more than 50 users easily uses up 250-500MB of RAM (in addition to the already heavy browser requirements). Loading slack on my laptop takes longer than booting the OS. I find it hard to justify the need for this kind of bloat for a chat. We talk about exchanging a few bytes of text between people in real time, with optional message history, something we do for 30+ years on way less beefy machines.

Also, more and more gets centralized, distributed and managed by a few big players in the field, who additionally make it hard for independent services to exist. This same centralization happens for other common, non-web services: for example, running your own mailserver today is more often than not a constant battle to not be blacklisted as a spam source, because many relays just consider @gmail.com and other big ones as the only trustworthy sender domains (which is ironic at best), no matter how clean your record is, how careful you set up everything for spam heuristics (like PTR records, DKIM, etc.).

Shining through in all paragraphs above is a feeling that seems to underlie this all, namely it all haven gotten more and more about power. In some areas it surely always was somewhere about power, but from the end user perspective it is so much more invasive now, and the content so much more shallow by average, and the positive enthousiasm over this big pool of information of the early web is now either gone or comes with a bitter after-taste.

Not to mention that there are so many things with the web that are broken to begin with, its tech's ever growing complexity will always be error prone, leading to all kinds of vulnerabilities, breaches and scams. Its one way links will always break eventually leading to frustration, and not to mention all the resources and energy wasted by all the useless bloat of the nowadays rapid-development web frameworks and often intentional content-hiding style.

I don't seem to be the only one that is worried about this development so here's a collection of links looking at this from plenty of different angles:

The Web is still a DARPA weapon. - click-baity title, but interesting reflection of power use and influence on the web
Why is Gopher Still Relevant? - overview of gopher in contrast to the web, which experiences some renewed interest b/c of it's structured, minimalist content-centricness
Project Xanadu - Ted Nelson is still around and his vision feels IMHO fresher than the current web
What We Have Now Is Not Advertising - reflections on targeted advertising
Save RSS and Atom! - arguments for assuring content-centricness, choice and privacy for consumption of web-content (it's sad that RSS/Atom were necessary to begin with)
The Bullshit Web - reflections on web-bloat in interesting depth
The Death of Transit and Beyond - super interesting presentation about content providers longing for even more power and getting into controlling more and more of the worldwide transit, with very spot on historical comparison to the "Gilded Age"
Bloated - more reflections on bloat
Software disenchantment - even more about bloat, not only about the web, nicely put into perspective
Freeing the Web from the Browser - ideas on better/less-broken hypermedia designs, also an intro to a comprehensive dissertation from the same author
Solid (web decentralization project) - the father of the web himself seems to see a need for change to go back to more decentralization and privacy
IndieWeb - calls for a less corporate centric web
How to Build a Low-tech Website? - call for less bloat with interesting example (also good suggestions and plenty more links in the comments)
BCHS stack - fast, secure and low-level web stack: less bloat, less complexity, less error prone black/hidden magic (see example)
Lo and Behold, Reveries of the Connected World - documentary by Werner Herzog, with Ted Nelson
weboob.org - Web outside of browsers/ - project from people that want features and not the presentation overhead of websites
CloudFlare, We Have A Problem - thoughts on Cloudflare and its centralization (also as MitM actor), it encouraging bad practices as a business model, ...
CloudFlare is ruining the internet (for me) - CF rant from a cultural perspective
The Website Obesity Crisis - slides of a talk about bloat
Project Gemini - Gopher inspired client-server protocol for hypertext, with simplicity as goal

Another worrisome thing going on, IMHO, is the push by many of the big players for DNS-over-HTTPS (DOH). They seem to propagate the claim that this is needed because DNS is insecure. Although a valid point at the time of writing, they ignore the fact that the latter is addressed by things like dnscrypt with DNSSEC. Besides the fact that pushing operating system level services like a name resolver into an application (e.g. a browser, where you would for example set the DoH resolver in Firefox via config option network.trr.uri) is insane to begin with, they seem to have strong reasons to push for DoH, probably motivated again by power and tracking reasons. Why wouldn't they be interested in overriding the globally by default decentralized ISP provided resolvers (which most peoples' system would get via DHCP if not overridden), and instead getting a grip on the majority of all DNS lookups by sending them by default to one or a few big players? Data is money after all. I don't want to even think about the massive single point of failure such a centralization would also be.

There are claims that this would somehow free you from those apparently evil ISPs that censor your requests. Sure, this might happen (as is the case with kinda every coffee shop or hotel uplink), but they don't mention how tasty it would be for them to resolve things for you that bypass your OS-level resolver, where you could setup per-domain blacklists to block trackers, adservers, malicious stuff, etc..

Wouldn't it be too nice for companies like Cloudflare to receive in a centralized way a major number of name lookups, by being the default resolver in one or more of the already few major browsers? Would you trust this US company, that already encourages people to give them their certificates to play man-in-the-middle, to know more about your browsing habits? Would you trust Google with their Chrome browser having even more insight into your live (and control over what you can access), but defaulting Chrome to use their resolvers? What a coincidence that that's a company that makes money by tracking you for targeted advertising... not. A quote that reflects my worries from the comments under this DoH intro:

Brett Glass: So, Mozilla intends to hack users' DNS, redirecting their queries away from their ISPs (which are trustworthy and with which they have a business relationship) to an untrustworthy VPN vendor - Cloudflare. Those users are not Cloudflare's customers, and so the only way Cloudflare can monetize this service is to spy on users and sell their personal information. In short, Mozilla is supporting, aiding, and abetting privacy invasion - probably in exchange for money from Cloudflare. Not only unethical but probably actionable by the FTC.

Some links specifically about DoH and the debate around it:

The inmates have taken over the asylum - co-architect of the DNS, Paul Vixie, having a strong opinion about this development; the comments are also interesting
Pros and Cons of DNS Over HTTPS - trying to reflect on the pros and cons, but IMHO falling short in many areas
Firefox, DNS over HTTPS and a controversial Shield Study - interesting article about mozilla's DoH study in Firefox with critical voices from within mozilla
Opinion: Clarifying what DoH means for privacy - APNIC's Geoff Huston and his view on DoH, based on his vast and long experience

To me this feels like it's all in line with the "Death of Transit" presentation linked to above in the first bullet point list: another worrisome development mainly supported by some internet megacorps longing for even more centralization and control.

As a closing line to avoid misinterpretation: my intro text as well as the articles in the first bullet point list are about the web as a content source, and not about the web's reinvention of the thin-client/mainframe model, which is what webapps basically are.

UPDATE (2019-06-03): Added a detailed and interesting opinion article to the DoH article list
UPDATE (2021-03-17): Added some more links

2019-03-14 by 0xpebbles Permalink

web gopher ad tracking surveillance bloat content