• ItJustDonn@slrpnk.net
    link
    fedilink
    English
    arrow-up
    6
    ·
    1 month ago

    Open source means it can be publicly audited to help soothe suspicion, right? I imagine that would take time, though, if it’s incredibly complex

    • Alex@lemmy.ml
      link
      fedilink
      English
      arrow-up
      15
      arrow-down
      1
      ·
      1 month ago

      Open source is a very loose term when it comes to GenAI. Like Llama the weights are available with few restrictions but importantly how it was trained is still secret. Not being reproducible doesn’t seem very open to me.

      • noscere@sh.itjust.works
        link
        fedilink
        English
        arrow-up
        4
        ·
        1 month ago

        True, but in this case I believe the also open sourced the training data and the training process.

        • Alex@lemmy.ml
          link
          fedilink
          English
          arrow-up
          5
          ·
          1 month ago

          Their paper outlines the training process but doesn’t supply the actual data or training code. There is a project on huggingface: https://huggingface.co/blog/open-r1 that is attempting a fully open recreation based on what is public.