Brodie a Linux YouTuber uploaded a video about a project, I was not aware off: Software Heritage. It is basically for code, what Internet Archive is for websites. If you want watch Brodies video as an introduction: https://inv.nadeko.net/watch?v=MUA9Fu4jNGY or YouTube directly https://youtu.be/MUA9Fu4jNGY


We are building the universal software archive

We collect and preserve software in source code form, because software embodies our technical and scientific knowledge and humanity cannot afford the risk of losing it.

Software is a precious part of our cultural heritage. We curate and make accessible all the software we collect, because only by sharing it we can guarantee its preservation in the very long term.

  • communism@lemmy.ml
    link
    fedilink
    arrow-up
    6
    arrow-down
    1
    ·
    16 hours ago

    That’s cool but is it necessary? If the licence permits redistribution then anyone can just upload to an existing software forge like Codeberg etc

      • communism@lemmy.ml
        link
        fedilink
        arrow-up
        2
        arrow-down
        3
        ·
        13 hours ago

        I don’t think it does address the question. In order to archive source code, you need to have the source code in the first place, ie you can’t archive truly lost source code. If you have the source code, you can upload it to any software forge.

        • thingsiplay@lemmy.mlOP
          link
          fedilink
          arrow-up
          4
          ·
          8 hours ago

          The point is, does it someone? This archive is doing exactly what you say someone could do, copying the software to a place that most likely will survive. They give some examples to what dangers are there, even for open source software. In example, are all Git repositories on Github and other personal repositories backed up on a safe place that will be available to the public at same place? All versions of it?

          Not all code is big and used as often and secured like the Linux code in example. 20 years from now, there will be software, that most individuals and companies will not have anymore on their servers and may not even care. Hardware fails, services disappear and so on. It’s like arguing that anyone can do a website copy to archive it, but does anyone do it? Same thing applies here.

          • communism@lemmy.ml
            link
            fedilink
            arrow-up
            1
            arrow-down
            3
            ·
            2 hours ago

            My point is that you don’t need a separate website for this; you can use existing software forge software and websites.

            • thingsiplay@lemmy.mlOP
              link
              fedilink
              arrow-up
              3
              ·
              2 hours ago

              I can’t archive the entirety of Github, Gitlab and many more services with all source code in all versions and metadata. And make it available to everyone at all times. This is not an effort to archive a few of my personal project, this is an attempt to archive every piece of software that can be archived. Otherwise do you not agree that the Internet Archive has a value archiving all the websites? This is similar, but for software code.

              • communism@lemmy.ml
                link
                fedilink
                arrow-up
                2
                arrow-down
                1
                ·
                1 hour ago

                So it mirrors repos before they go down? I think I get it if that’s the case; I thought it was just a host for “lost” software/source code in which case if you have a copy you can upload it to any software forge (if permitted by the licence). But if it’s meant to contain all software that currently exists, even if it shows no sign of disappearing, that makes more sense.

                • thingsiplay@lemmy.mlOP
                  link
                  fedilink
                  arrow-up
                  1
                  ·
                  51 minutes ago

                  Yes, that’s basically it. It’s a backup, with the intent of being the most comprehensive and secure backup, not controlled by a single company (other than this organization off course). As long as it gets funded by various sources, this should be available in the future. Hopefully.

                  Some additional personal thoughts: This should have better chances to archive than Internet Archive does, as they only archive content that is Open Source (as far as I know). And a reason why big companies fund this is probably they want to use it for Ai… just my speculation on my part…

    • thingsiplay@lemmy.mlOP
      link
      fedilink
      arrow-up
      8
      ·
      16 hours ago

      Sponsored just means its giving money to do the job. Otherwise, Linux and many other open source projects are sponsored by Microsoft and Google.

        • boogiebored@lemmy.world
          link
          fedilink
          arrow-up
          3
          ·
          14 hours ago

          Fair for maybe this specific type of use case, but do you not use anything associated with those corps or similar?

          I am not sure that is possible, but if so, I am all ears.

        • thingsiplay@lemmy.mlOP
          link
          fedilink
          arrow-up
          3
          ·
          15 hours ago

          What exactly is the problem? What would be different if it wasn’t sponsored by Microsoft and Google? It would be sponsored by others and they do the exact same job. I am not saying you have to use it. I just don’t get the point as a reason to avoid. I mean look, maybe if you have good reasoning maybe I will change my mind and avoid it too. But would you avoid Linux too then?