Did you know that Google provides a free API to retrieve the favicon for any website?

Pro@programming.dev · 4 days ago

Did you know that Google provides a free API to retrieve the favicon for any website?

FrostyPolicy@suppo.fi · edit-2 4 days ago

Why would you ever want to route your request like this through a third party especially an ad company? To get the favicon of a site you just request www.example.com/favicon.ico.

argv minus one@mastodon.sdf.org · 4 days ago

@FrostyPolicy

Not necessarily. You need to fetch the HTML of the web page you want the icon for and see if there’s a <link rel=icon> or equivalent HTTP header.

And yes, this means different pages on the same site can have different icons.

@Pro

m0xEE@nosh0b10.m0xee.net · 4 days ago

@argv_minus_one@mastodon.sdf.org @FrostyPolicy@suppo.fi @Pro@programming.dev
AFAIK there’s also a special file name Safari uses for bigger PNG icons. That Google thing probably just facilitates all this machinery, but I agree — I wouldn’t rely on Google for such things myself.

bleistift2@sopuli.xyz · 4 days ago

It doesn’t have to be an *.ico file. So you might need to try different file extensions.

who@feddit.org · edit-2 3 days ago

So you might need to try different file extensions.

No, you don’t. The HTTP response header will tell you what type it is. Anything using file name suffixes to determine content type on the web (unless it’s just a fallback guess) is broken.

bleistift2@sopuli.xyz · edit-2 2 days ago

So you might need to try different file extensions.

No, you don’t. The HTTP response header will tell you what type it is.

I’m not sure what you’re talking about. You need to know the name of the file before you can download it.

For instance, my lemmy client, says in its HTML that its favicon is <link id="favicon" rel="shortcut icon" type="image/x-icon" href="https://sopuli.xyz/pictrs/image/9c6eeb58-bf66-4a15-9537-0a822f3c4feb.png">. If I were to blindly download /favicon.ico, I’d naturally get a 404 page:

$ curl https://sopuli.xyz/favicon.ico -i
HTTP/2 404 
# more data

who@feddit.org · edit-2 2 days ago

You need to know the name of the file before you can download it.

Indeed. And /favicon.ico might not be a Windows ICO file, but instead be a PNG, GIF, or some other image format. I thought that’s what you were (correctly) pointing out when you wrote “it doesn’t have to be an *.ico file”. In such cases, the HTTP Content-Type field tells what image format it is.

If I were to blindly download /favicon.ico, I’d naturally get a 404 page:

Ah, so it turns out you were thinking of cases where /favicon.ico doesn’t exist at all. That can also happen, but your suggestion to “try different file extensions” is not the answer, as you can see if you try to curl /favicon.png, /favicon.gif, etc. The correct approach is to parse a web page’s HTML in search of a favicon URL, which you did in your above reply, but that’s not the same thing as what you originally suggested.

FrostyPolicy@suppo.fi · 4 days ago

True. I’ve just seen requests to that file in my demo env for web apps I have and I’ve seen my browser request that file by itself when running those locally.

kevincox@lemmy.ml · edit-2 3 days ago

/favicon.ico is the only “default” URL. /favicon.ico is usually not an actual “icon” type anymore but PNG or JPG (but with the same URL). Other than that you need to load the HTML and check for Link headers or <link rel=icon> elements. While URLs like /favicon.png may be popular they aren’t part of any actual protocol.