• FrostyPolicy@suppo.fi
    link
    fedilink
    arrow-up
    35
    arrow-down
    2
    ·
    edit-2
    4 days ago

    Why would you ever want to route your request like this through a third party especially an ad company? To get the favicon of a site you just request www.example.com/favicon.ico.

    • argv minus one@mastodon.sdf.org
      link
      fedilink
      arrow-up
      12
      ·
      4 days ago

      @FrostyPolicy

      Not necessarily. You need to fetch the HTML of the web page you want the icon for and see if there’s a <link rel=icon> or equivalent HTTP header.

      And yes, this means different pages on the same site can have different icons.

      @Pro

      • who@feddit.org
        link
        fedilink
        English
        arrow-up
        3
        ·
        edit-2
        3 days ago

        So you might need to try different file extensions.

        No, you don’t. The HTTP response header will tell you what type it is. Anything using file name suffixes to determine content type on the web (unless it’s just a fallback guess) is broken.

        • bleistift2@sopuli.xyz
          link
          fedilink
          English
          arrow-up
          2
          ·
          edit-2
          2 days ago

          So you might need to try different file extensions.

          No, you don’t. The HTTP response header will tell you what type it is.

          I’m not sure what you’re talking about. You need to know the name of the file before you can download it.

          For instance, my lemmy client, says in its HTML that its favicon is <link id="favicon" rel="shortcut icon" type="image/x-icon" href="https://sopuli.xyz/pictrs/image/9c6eeb58-bf66-4a15-9537-0a822f3c4feb.png">. If I were to blindly download /favicon.ico, I’d naturally get a 404 page:

          $ curl https://sopuli.xyz/favicon.ico -i
          HTTP/2 404 
          # more data
          
          • who@feddit.org
            link
            fedilink
            English
            arrow-up
            2
            arrow-down
            1
            ·
            edit-2
            2 days ago

            You need to know the name of the file before you can download it.

            Indeed. And /favicon.ico might not be a Windows ICO file, but instead be a PNG, GIF, or some other image format. I thought that’s what you were (correctly) pointing out when you wrote “it doesn’t have to be an *.ico file”. In such cases, the HTTP Content-Type field tells what image format it is.

            If I were to blindly download /favicon.ico, I’d naturally get a 404 page:

            Ah, so it turns out you were thinking of cases where /favicon.ico doesn’t exist at all. That can also happen, but your suggestion to “try different file extensions” is not the answer, as you can see if you try to curl /favicon.png, /favicon.gif, etc. The correct approach is to parse a web page’s HTML in search of a favicon URL, which you did in your above reply, but that’s not the same thing as what you originally suggested.

      • FrostyPolicy@suppo.fi
        link
        fedilink
        arrow-up
        3
        ·
        4 days ago

        True. I’ve just seen requests to that file in my demo env for web apps I have and I’ve seen my browser request that file by itself when running those locally.

      • kevincox@lemmy.ml
        link
        fedilink
        arrow-up
        2
        ·
        edit-2
        3 days ago

        /favicon.ico is the only “default” URL. /favicon.ico is usually not an actual “icon” type anymore but PNG or JPG (but with the same URL). Other than that you need to load the HTML and check for Link headers or <link rel=icon> elements. While URLs like /favicon.png may be popular they aren’t part of any actual protocol.