RSS Server Side Reader

4 months ago 6

I like the idea of RSS, but none of the RSS readers stuck with me, until I implemented one of my own, using a somewhat unusual technique. There’s at least one other person using this approach now, so let’s write this down.

About Feed Readers

As I’ve mentioned, while I like the ideas behind RSS, none of the existing RSS readers worked for me. They try to do more than I need. A classical RSS reader fetches full content of the articles, saves it for offline reading and renders the content using an embedded web-browser. I don’t need this. I prefer reading the articles on the author’s website, using my normal browser (and, occasionally, its reader mode). The only thing I need is notifications.

What Didn’t Work: Client Side Reader

My first attempt at my own RSS reader was to create a web page that stored the state in the browser’s local storage. This idea was foiled by CORS. In general, if a client-side JavaScript does a fetch it can only fetch resources from the domain the page itself is hosted on. But feeds are hosted on other domains.

What Did Work: SSR

I have a blog. You are reading it. I now build my personalized feed as a part of this blog’s build process. It is hosted at https://matklad.github.io/blogroll.html

This list is stateless: for each feed I follow, I display the latest three posts, newer posts on top. I don’t maintain read/unread state. If I don’t remember whether I read the article or not, I might as well re-read! I can access this list from any device.

While it is primarily for me, the list is publicly available, and might be interesting for some readers of my blog. Hopefully, it also helps to page-rank the blogs I follow!

The source of truth is the blogroll.txt. It is a simple list of links, with one link per line. Originally, I tried using OPML, but it is far too complicated for what I need here, and is actively inconvenient to modify by hand.

Here’s the entire code to fetch the blogroll, using this library:

import { parseFeed } from "@rss"; export interface FeedEntry { title: string; url: string; date: Date; } export async function blogroll(): Promise<FeedEntry[]> { const urls = (await Deno.readTextFile("content/blogroll.txt")) .split("\n").filter((line) => line.trim().length > 0); const all_entries = (await Promise.all(urls.map(blogroll_feed))).flat(); all_entries.sort((a, b) => b.date.getTime() - a.date.getTime()); return all_entries; } async function blogroll_feed( url: string ): Promise<FeedEntry[]> { let feed; try { const response = await fetch(url); const xml = await response.text(); feed = await parseFeed(xml); } catch (error) { console.error({ url, error }); return []; } return feed.entries.map((entry: any) => { return { title: entry.title!.value!, url: (entry.links.find((it: any) => { it.type == "text/html" || it.href!.endsWith(".html"); }) ?? entry.links[0])!.href!, date: (entry.published ?? entry.updated)!, }; }).slice(0, 3); }

And this is how the data is converted to HTML during build process:

export function BlogRoll( { posts }: { posts: FeedEntry[] } ) { function domain(url: string): string { return new URL(url).host; } const list_items = posts.map((post) => ( <li> <h2> <span class="meta"> <Time date={post.date} />, {domain(post.url)} </span> <a href={post.url}>{post.title}</a> </h2> </li> )); return ( <Base> <ul class="post-list"> {list_items} </ul> </Base> ); }

GitHub actions re-builds blogroll every midnight:

name: CI on: push: branches: - master schedule: - cron: "0 0 * * *" jobs: CI: runs-on: ubuntu-latest permissions: pages: write id-token: write steps: - uses: actions/checkout@v4 - uses: denoland/setup-deno@v2 with: deno-version: v2.x - run: deno task build --blogroll - uses: actions/upload-pages-artifact@v3 with: path: ./out/www - uses: actions/deploy-pages@v4