This is an interesting demo, but it has some drawbacks I can already see:

  • It’s Windows only (maybe Win11 only, the documentation isn’t clear)
  • It only works with RTX 30 series and up
  • It’s closed source, so you have no idea if they’re uploading your data somewhere

The concept is great, having an LLM to sort through your local files and help you find stuff, but it seems really limited.

I think you could get the same functionality(and more) by writing an API for text-gen-webui.

more info here: https://videocardz.com/newz/nvidia-unveils-chat-with-rtx-ai-chatbot-powered-locally-by-geforce-rtx-30-40-gpus

  • General_Effort@lemmy.world
    link
    fedilink
    English
    arrow-up
    3
    ·
    9 months ago

    It means that they want people to consult the code as a reference for how to best use the hardware acceleration.

    If all software uses their cards to best effect, that makes their cards more useful and thus more valuable; making them money. If only their own frontend can do that, they lose out on most of that, while also having to spend money to make sure that the rest of the software, like the UI, is competitive.

    • Donut
      link
      fedilink
      English
      arrow-up
      2
      ·
      edit-2
      9 months ago

      Ah, that makes sense. Thank you!