Leminal Space
  • Communities
  • Create Post
  • Create Community
  • heart
    Support Lemmy
  • search
    Search
  • Login
  • Sign Up
Davriellelouna@lemmy.world to Technology@lemmy.worldEnglish · 1 个月前

OpenAI beats Elon Musk's Grok in AI chess tournament

www.bbc.com

external-link
message-square
19
link
fedilink
58
external-link

OpenAI beats Elon Musk's Grok in AI chess tournament

www.bbc.com

Davriellelouna@lemmy.world to Technology@lemmy.worldEnglish · 1 个月前
message-square
19
link
fedilink
The tournament saw models from Anthropic, Google, xAI and DeepSeek compete against each other to be crowned the top AI chess player.
alert-triangle
You must log in or # to comment.
  • acosmichippo@lemmy.world
    link
    fedilink
    English
    arrow-up
    75
    ·
    1 个月前

    Grok was thrown off by being assigned the black pieces for the match.

  • Asafum@feddit.nl
    link
    fedilink
    English
    arrow-up
    60
    ·
    1 个月前

    “Grok then generated an image of a chess board being flipped over and complained “I only lost because the JEWS own chess!” Elon Musk could not be reached for comment as he’s currently lost in a K hole.”

    • panda_abyss@lemmy.ca
      link
      fedilink
      English
      arrow-up
      21
      ·
      1 个月前

      I can’t tell if this is satire or

    • HubertManne@piefed.social
      link
      fedilink
      English
      arrow-up
      6
      ·
      1 个月前

      I came to say something about it flipping over the table.

  • AbouBenAdhem@lemmy.world
    link
    fedilink
    English
    arrow-up
    34
    ·
    1 个月前

    “Up until the semi finals, it seemed like nothing would be able to stop Grok 4 on its way to winning the event,” Pedro Pinhata, a writer for Chess.com, said in its coverage. “Despite a few moments of weakness, X’s AI seemed to be by far the strongest chess player… But the illusion fell through on the last day of the tournament.” He said Grok’s “unrecognizable” and “blundering” play enabled o3 to claim a succession of “convincing wins”.

    I think the main takeaway is that these models are fundamentally inconsistent, and you can never assume they’re going to be reliable based on past performance.

    • bigfondue@lemmy.world
      link
      fedilink
      English
      arrow-up
      25
      ·
      1 个月前

      And they’d both get destroyed by StockFish

      • Skullgrid@lemmy.world
        link
        fedilink
        English
        arrow-up
        15
        ·
        1 个月前

        No idea what the point of this tournament was.

        • snooggums@lemmy.world
          link
          fedilink
          English
          arrow-up
          11
          ·
          1 个月前

          Getting attention.

        • palordrolap@fedia.io
          link
          fedilink
          arrow-up
          8
          ·
          1 个月前

          D*ck measuring contest.

        • hanabatake@lemmy.ml
          link
          fedilink
          English
          arrow-up
          4
          ·
          1 个月前

          Fun, IA helps human players explore new ideas, games allow researchers to observe their IA interactions in other settings …

        • kometes@lemmy.world
          link
          fedilink
          English
          arrow-up
          4
          ·
          30 天前

          Special Olympics

    • acosmichippo@lemmy.world
      link
      fedilink
      English
      arrow-up
      2
      ·
      1 个月前

      or they are matchup dependent based on the strategies they were trained on.

  • latenightnoir@lemmy.blahaj.zone
    link
    fedilink
    English
    arrow-up
    15
    ·
    edit-2
    1 个月前

    Meh… Robot Wars is better…

  • Repple (she/her)@lemmy.world
    link
    fedilink
    English
    arrow-up
    11
    ·
    1 个月前

    I haven’t tried in a while, but shortly after gpt4 came out I tried to play chess against it. It just completely changed the board position nearly every move making illegal moves, adding pieces etc. do current models keep track of the board and make legal moves without special prompting to help? Were these assisted by agentic tools handling state?

    • acosmichippo@lemmy.world
      link
      fedilink
      English
      arrow-up
      1
      ·
      edit-2
      1 个月前

      deleted by creator

  • RagingSnarkasm@lemmy.world
    link
    fedilink
    English
    arrow-up
    6
    ·
    30 天前

    “I got winner.”

    –Atari 2600, probably

  • SugarCatDestroyer@lemmy.world
    link
    fedilink
    English
    arrow-up
    5
    ·
    edit-2
    1 个月前

    What useful information… It helped me so much in real life and to hell with it all lol.

  • IcyToes@sh.itjust.works
    link
    fedilink
    English
    arrow-up
    2
    ·
    27 天前

    AI dick measuring contest?

  • MrSulu@lemmy.ml
    link
    fedilink
    English
    arrow-up
    2
    ·
    30 天前

    In a formal response from Musk he said nothing meaningful.

  • m3t00@piefed.world
    link
    fedilink
    English
    arrow-up
    1
    ·
    30 天前

    king me

Technology@lemmy.world

technology@lemmy.world

Subscribe from Remote Instance

Create a post
You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: !technology@lemmy.world

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related news or articles.
  3. Be excellent to each other!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
  9. Check for duplicates before posting, duplicates may be removed
  10. Accounts 7 days and younger will have their posts automatically removed.

Approved Bots


  • @L4s@lemmy.world
  • @autotldr@lemmings.world
  • @PipedLinkBot@feddit.rocks
  • @wikibot@lemmy.world
Visibility: Public
globe

This community can be federated to other instances and be posted/commented in by their users.

  • 4.14K users / day
  • 8.51K users / week
  • 16.6K users / month
  • 37.2K users / 6 months
  • 134 local subscribers
  • 74.9K subscribers
  • 16.7K Posts
  • 708K Comments
  • Modlog
  • mods:
  • L3s@lemmy.world
  • enu@lemmy.world
  • Technopagan@lemmy.world
  • L4sBot@lemmy.world
  • L3s@hackingne.ws
  • L4s@hackingne.ws
  • BE: 0.19.12
  • Modlog
  • Legal
  • Instances
  • Docs
  • Code
  • join-lemmy.org