Poker"Industry executives and analy

Poker

"Industry executives and analysts often mistakenly talk about strategy as if it were some kind of chess match. But in chess, you have just two opponents, each with identical resources, and with luck playing a minimal role. The real world is much more like a poker game, with multiple players trying to make the best of whatever hand fortune has dealt them." (David Moschella, Computerworld, 1999

Poker is the canonical example of a stochastic game with imperfect information. These are games with random chance (when cards are dealt from the deck) and some information is hidden from the players (their opponents' private cards). Poker is famous for its psychological aspects: bluffing with bad cards, feigning weakness with strong ones, and reading an opponent's facial expressions and "tells" to discover what cards they are hiding. In spite of these "human" elements, computers can still play poker at a high level of skill, including bluffing and other tricky plays, by approaching the game with game theory.
Poker is a family of wagering card games with different numbers of players, betting structures, and rules. The current most popular variant of poker, played in casinos and seen on television, is no-limit Texas hold'em. This game and a smaller variant, limit Texas hold'em, have been used as a testbed for artificial intelligence research since 1997. Since 2006, the Annual Computer Poker Competition has allowed researchers, programmers, and poker players to play their poker programs against each other, allowing us to find out which artificial intelligence techniques work best in practice. The competition has resulted in significant advances in fields such as computational game theory, and resulted in algorithms that can find optimal strategies for games six orders of magnitude larger than was possible using earlier techniques.
At the 2007 AAAI conference, a program called Polaris made by the University of Alberta's Computer Poker Research Group played heads-up limit Texas hold'em against two human professionals, Phil Laak and Ali Eslami, in the First Man-Machine Poker Championship. Over four duplicate poker matches, Polaris won once, tied once, and lost twice, losing the match overall by a small margin. In Las Vegas in 2008, an updated version of Polaris played against six human heads-up limit specialists in the Second Man-Machine Poker Championship. Over six matches, Polaris won three, lost two and tied one to win the event by a small margin. This marked the first time that a computer program defeated human poker professionals in a meaningful competition. Computer programs have continued to improve, and the top competitors in the Annual Computer Poker Competition are much stronger than the version of Polaris used in the 2008 competition.
Challenges for Artificial Intelligence Research:
Poker offers a rich set of research challenges that are not seen in popular AI research testbeds such as checkers, chess, and Go. These include:
• Imperfect information, where one or more players cannot determine the exact state of the game.
• Stochastic outcomes, where random chance affects the course of the game.
• Multiple players (two to ten). Games with more than one opponent are strategically very different from two-player games.
• Repeated interaction, where the players play a long series of games and can learn and adapt to their opponent over time.
• Variable payouts, so that players must adapt to exploit their opponents' mistakes in order to maximize their winnings.
Current Approaches:
In two-player poker games, the most successful approach to date is based on game theory. A Nash equilibrium is a set of strategies, one for each player, where no player can increase the amount they win by changing to a different strategy. In two-player zero-sum games, such as two-player poker games, one player using a Nash equilibrium strategy is guaranteed to not lose against any adversary, even a worst-case adversary that knows what the strategy is. If the adversary makes mistakes, then a Nash equilibrium strategy can win. If a Nash equilibrium strategy is computed before the game is played, it can be used against any adversary without needing to know anything about their strategy. This is a nice property in poker, because it means that we can use it to at worst tie the game, while learning what mistakes the opponent is making so that we can change our strategy in response. The Annual Computer Poker Competition has driven significant effort towards approximating Nash equilibrium strategies in very large games, and the best programs in two-player limit Texas hold'em are now very close to a Nash equilibrium strategy. This approach was used in the 2007 and 2008 Man-vs-Machine Poker Championships, and is also used by most of the top competitors in the Annual Computer Poker Competition.
The other popular approach in the Computer Poker Competition involves observing strong players, such as good humans or good computer programs, and learning a strategy that mimics their behavior. Recently, this has involved case-based reasoning, in which a program's current decision is compared to a set of examples stored in memory. The most similar examples are used to choose a response.
Open Problems:
While computer programs are now very strong in the simplest real poker game played in the competition, two-player limit Texas hold'em poker, there are many open problems and new challenges that need to be addressed:
• State-space abstraction. Human-scale poker games are far too large to exactly compute optimal strategies. Instead, a technique is used to simplify the game down to a size where we can compute good (but no longer perfect) strategies. This usually means finding decision points in the game that are strategically similar and merging them together. Automated techniques for doing this will become more and more important as we move from two-player limit Texas hold'em to larger games, like no-limit or multi-player games.
• Multi-player games. The most successful approach to date for making strong computer poker agents uses the mathematics of game theory to decide how to pick actions so that the strategy cannot lose much, or at all, against a perfect opponent. In games with more than one opponent, however, the mathematical guarantees of this approach are no longer useful. Instead of using a single precomputed strategy, a good computer poker agent will instead have to observe its opponents and choose a strategy that works well in response. This is largely an open problem.
• Adapting to opponents during a match. The goal of poker is to maximize the amount of money won from the opponents. However, the game theoretic approach to the game pursues a different goal, by trying to minimize how much money is lost to a perfect adversary. In order to win as much as possible from an opponent, a player has to observe the opponent's actions, find out what mistakes the opponent is making, and change their strategy in response to take advantage of those mistakes. Computer poker research has led to effective techniques that work when the opponent is observed for millions of games, but human players are able to adjust their strategy after only tens or hundreds of games.

Poker

"Industry executives and analysts often mistakenly talk about strategy as if it were some kind of chess match. But in chess, you have just two opponents, each with identical resources, and with luck playing a minimal role. The real world is much more like a poker game, with multiple players trying to make the best of whatever hand fortune has dealt them." (David Moschella, Computerworld, 1999

Poker is the canonical example of a stochastic game with imperfect information. These are games with random chance (when cards are dealt from the deck) and some information is hidden from the players (their opponents' private cards). Poker is famous for its psychological aspects: bluffing with bad cards, feigning weakness with strong ones, and reading an opponent's facial expressions and "tells" to discover what cards they are hiding. In spite of these "human" elements, computers can still play poker at a high level of skill, including bluffing and other tricky plays, by approaching the game with game theory.
Poker is a family of wagering card games with different numbers of players, betting structures, and rules. The current most popular variant of poker, played in casinos and seen on television, is no-limit Texas hold'em. This game and a smaller variant, limit Texas hold'em, have been used as a testbed for artificial intelligence research since 1997. Since 2006, the Annual Computer Poker Competition has allowed researchers, programmers, and poker players to play their poker programs against each other, allowing us to find out which artificial intelligence techniques work best in practice. The competition has resulted in significant advances in fields such as computational game theory, and resulted in algorithms that can find optimal strategies for games six orders of magnitude larger than was possible using earlier techniques.
At the 2007 AAAI conference, a program called Polaris made by the University of Alberta's Computer Poker Research Group played heads-up limit Texas hold'em against two human professionals, Phil Laak and Ali Eslami, in the First Man-Machine Poker Championship. Over four duplicate poker matches, Polaris won once, tied once, and lost twice, losing the match overall by a small margin. In Las Vegas in 2008, an updated version of Polaris played against six human heads-up limit specialists in the Second Man-Machine Poker Championship. Over six matches, Polaris won three, lost two and tied one to win the event by a small margin. This marked the first time that a computer program defeated human poker professionals in a meaningful competition. Computer programs have continued to improve, and the top competitors in the Annual Computer Poker Competition are much stronger than the version of Polaris used in the 2008 competition. 
Challenges for Artificial Intelligence Research:
Poker offers a rich set of research challenges that are not seen in popular AI research testbeds such as checkers, chess, and Go. These include:
• Imperfect information, where one or more players cannot determine the exact state of the game.
• Stochastic outcomes, where random chance affects the course of the game.
• Multiple players (two to ten). Games with more than one opponent are strategically very different from two-player games.
• Repeated interaction, where the players play a long series of games and can learn and adapt to their opponent over time.
• Variable payouts, so that players must adapt to exploit their opponents' mistakes in order to maximize their winnings.
Current Approaches: 
In two-player poker games, the most successful approach to date is based on game theory. A Nash equilibrium is a set of strategies, one for each player, where no player can increase the amount they win by changing to a different strategy. In two-player zero-sum games, such as two-player poker games, one player using a Nash equilibrium strategy is guaranteed to not lose against any adversary, even a worst-case adversary that knows what the strategy is. If the adversary makes mistakes, then a Nash equilibrium strategy can win. If a Nash equilibrium strategy is computed before the game is played, it can be used against any adversary without needing to know anything about their strategy. This is a nice property in poker, because it means that we can use it to at worst tie the game, while learning what mistakes the opponent is making so that we can change our strategy in response. The Annual Computer Poker Competition has driven significant effort towards approximating Nash equilibrium strategies in very large games, and the best programs in two-player limit Texas hold'em are now very close to a Nash equilibrium strategy. This approach was used in the 2007 and 2008 Man-vs-Machine Poker Championships, and is also used by most of the top competitors in the Annual Computer Poker Competition.
The other popular approach in the Computer Poker Competition involves observing strong players, such as good humans or good computer programs, and learning a strategy that mimics their behavior. Recently, this has involved case-based reasoning, in which a program's current decision is compared to a set of examples stored in memory. The most similar examples are used to choose a response.
Open Problems:
While computer programs are now very strong in the simplest real poker game played in the competition, two-player limit Texas hold'em poker, there are many open problems and new challenges that need to be addressed:
• State-space abstraction. Human-scale poker games are far too large to exactly compute optimal strategies. Instead, a technique is used to simplify the game down to a size where we can compute good (but no longer perfect) strategies. This usually means finding decision points in the game that are strategically similar and merging them together. Automated techniques for doing this will become more and more important as we move from two-player limit Texas hold'em to larger games, like no-limit or multi-player games.
• Multi-player games. The most successful approach to date for making strong computer poker agents uses the mathematics of game theory to decide how to pick actions so that the strategy cannot lose much, or at all, against a perfect opponent. In games with more than one opponent, however, the mathematical guarantees of this approach are no longer useful. Instead of using a single precomputed strategy, a good computer poker agent will instead have to observe its opponents and choose a strategy that works well in response. This is largely an open problem.
• Adapting to opponents during a match. The goal of poker is to maximize the amount of money won from the opponents. However, the game theoretic approach to the game pursues a different goal, by trying to minimize how much money is lost to a perfect adversary. In order to win as much as possible from an opponent, a player has to observe the opponent's actions, find out what mistakes the opponent is making, and change their strategy in response to take advantage of those mistakes. Computer poker research has led to effective techniques that work when the opponent is observed for millions of games, but human players are able to adjust their strategy after only tens or hundreds of games.

0/5000

จาก: -

เป็น: -

ผลลัพธ์ (ไทย) 1: [สำเนา]

คัดลอก!

โป๊กเกอร์"นักอุตสาหกรรมและนักวิเคราะห์มักจะแสดงพูดคุยเกี่ยวกับกลยุทธ์นั้นบางชนิดจับคู่หมากรุก แต่หมากรุก คุณมีเพียงสองฝ่ายตรงข้าม แต่ละทรัพยากรที่เหมือนกัน และโชคที่มีบทบาทเกี่ยวข้องน้อยที่สุด โลกจริงได้มากขึ้นเช่นเกมโป๊กเกอร์ มีผู้เล่นหลายคนที่พยายามทำให้ส่วนของฟอร์จูนมือใดมีแจก" (David Moschella, Computerworld, 1999โป๊กเกอร์เป็นตัวอย่างมาตรฐานของเกมแบบเฟ้นสุ่มข้อมูลไม่สมบูรณ์ นี่คือเกมที่ มีโอกาสสุ่ม (เมื่อมีแจกบัตรจากสำรับ) และบางข้อมูลถูกซ่อนจากผู้เล่น (ของฝ่ายตรงข้ามบัตรส่วนตัว) โป๊กเกอร์มีชื่อเสียงในด้านของจิตใจ: bluffing กับเสีย การ์ด feigning อ่อนแอกับคนแข็งแรง และอ่านนิพจน์ใบหน้าของฝ่ายตรงข้าม และ "บอก" ได้บัตรอะไรพวกเขาซ่อนตัวอยู่ แม้ว่าองค์ประกอบเหล่านี้ "มนุษย์" คอมพิวเตอร์สามารถยังคงเล่นในระดับสูงของทักษะ รวม bluffing และอื่น ๆ หากินเล่น โดยกำลังเกมกับทฤษฎีเกมโป๊กเกอร์เป็นครอบครัวของพนันขันต่อเกมบัตรกับตัวเลขที่แตกต่างกันของผู้เล่น โครงสร้าง การวางเดิมพัน และกฎ ตัวแปรปัจจุบันนิยมมากที่สุดของโป๊กเกอร์ เล่นในคาสิโน และเห็นในโทรทัศน์ จะไม่มีขีดจำกัดเท็กซัสโฮลด์ เกมนี้และย่อยเล็ก เท็กซัสโฮลด์จำกัด การใช้เป็น testbed การวิจัยปัญญาประดิษฐ์ตั้งแต่ปี 1997 2549 การแข่งขันโป๊กเกอร์คอมพิวเตอร์ประจำปีได้อนุญาตให้นักวิจัย โปรแกรมเมอร์ และผู้เล่นโป๊กเกอร์เล่นโป๊กเกอร์โปรแกรมของพวกเขากับแต่ละอื่น ๆ ทำให้เราสามารถค้นหาออกปัญญาประดิษฐ์ซึ่งเทคนิคการทำงานส่วนในทางปฏิบัติ การแข่งขันทำให้เกิดความก้าวหน้าสำคัญในการคำนวณทฤษฎีเกม และในอัลกอริทึมที่สามารถหากลยุทธ์ที่เหมาะสมที่สุดสำหรับเกมหกอันดับของขนาดใหญ่กว่าสามารถใช้เทคนิคก่อนหน้าในการประชุม AAAI 2007 โปรแกรมที่เรียกว่าโพลาริสโดย กลุ่มวิจัยมหาวิทยาลัยเบอร์ของคอมพิวเตอร์โป๊กเกอร์เล่น heads-up จำกัดเท็กซัสโฮลด์กับสองมนุษย์ผู้เชี่ยวชาญ Phil Laak และ Ali Eslami ในแชมป์โป๊กเกอร์เครื่องคนแรก 4 โป๊กเกอร์ซ้ำตรง โพลาริสชนะครั้ง ผูกครั้ง และแพ้สอง สูญเสียการแข่งขันโดยรวม โดยระยะขอบขนาดเล็ก ในลาสเวกัสในปี 2008 โพลาริสรุ่นปรับปรุงเล่นกับผู้เชี่ยวชาญจำกัดมนุษย์ heads-up หกในแชมป์โป๊กเกอร์เครื่องคนที่สอง ตรงกันกว่า 6 โพลาริสชนะ 3 แพ้สอง กผูกหนึ่งชนะเหตุการณ์ โดยระยะขอบเล็ก ทำเครื่องหมายเป็นครั้งแรกที่โปรแกรมคอมพิวเตอร์พ่ายแพ้มนุษย์โป๊กเกอร์มืออาชีพในการแข่งขันมีความหมาย โปรแกรมคอมพิวเตอร์ได้อย่างต่อเนื่องเพื่อปรับปรุง และคู่แข่งสูงสุดในการแข่งขันโป๊กเกอร์คอมพิวเตอร์ประจำปีจะแกร่งกว่ารุ่นของโพลาริสที่ใช้ในการแข่งขันปี 2008 ความท้าทายสำหรับการวิจัยปัญญาประดิษฐ์:โป๊กเกอร์มีชุดหลากหลายของความท้าทายงานวิจัยที่ไม่ได้เห็นใน testbeds วิจัย AI นิยมเช่นหมากฮอส หมากรุก และไป เหล่านี้รวมถึง:•ข้อมูลไม่สมบูรณ์ ที่ผู้เล่นอย่าง น้อยหนึ่งไม่สามารถระบุสถานะของเกมแน่นอน•แบบเฟ้นสุ่มผลลัพธ์ ที่สุ่มโอกาสส่งผลกระทบต่อหลักสูตรของเกม•ผู้เล่นหลายคน (2-10) เกมกับฝ่ายตรงข้ามมากกว่าหนึ่งมีกลยุทธ์แตกต่างมากจากสองผู้เล่นเกม•ซ้ำโต้ตอบ ที่ผู้เล่นเล่นเกมชุดยาว และสามารถเรียนรู้ และปรับให้เข้ากับฝ่ายตรงข้ามเวลา•ตัวแปร payouts เพื่อให้ผู้เล่นต้องปรับการใช้ข้อผิดพลาดของฝ่ายตรงข้ามในการใบสั่งเพื่อเพิ่มรายได้ของพวกเขาวิธีปัจจุบัน: ในเกมสองผู้เล่นโป๊กเกอร์ วิธีประสบความสำเร็จมากที่สุดถึงวันเป็นไปตามทฤษฎีเกม สมดุล Nash เป็นชุดของกลยุทธ์ หนึ่งสำหรับแต่ละเครื่องเล่น เล่นไม่สามารถเพิ่มยอดผู้ชนะ โดยการเปลี่ยนกลยุทธ์ที่แตกต่างกัน ผู้เล่นสองศูนย์–ผลรวม เกมเช่นเกมสองผู้เล่นโป๊กเกอร์ รับประกันหนึ่งผู้เล่นใช้กลยุทธ์สมดุล Nash จะไม่สูญเสียกับฝ่ายใด แม้แต่เชิง worst-case ที่รู้ว่ากลยุทธ์คืออะไร ถ้าโมโหจะทำให้ความผิดพลาด แล้วกลยุทธ์สมดุล Nash สามารถชนะ ถ้ากลยุทธ์สมดุล Nash คำนวณก่อนเล่นเกม สามารถใช้กับศัตรูใด ๆ ไม่จำเป็นต้องทราบอะไรเกี่ยวกับกลยุทธ์ของพวกเขา นี้เป็นอย่างดีในโป๊กเกอร์ เพราะมันหมายถึง ว่า เราสามารถใช้ผูกเกม ในขณะที่ข้อผิดพลาดใดฝ่ายตรงข้ามทำให้เราสามารถเปลี่ยนแปลงกลยุทธ์ของเราตอบสนองต่อการเรียนรู้ที่เลวร้ายที่สุด การแข่งขันโป๊กเกอร์คอมพิวเตอร์ประจำปีได้ขับเคลื่อนความพยายามอย่างมีนัยสำคัญต่อระหว่างกลยุทธ์สมดุล Nash ในเกมขนาดใหญ่ และโปรแกรมที่ดีที่สุดในวงเงินสองเล่นเท็กซัสโฮลด์อยู่ตอนนี้ใกล้กับกลยุทธ์สมดุล Nash วิธีการนี้ใช้ในการ 2007 และ 2008 คนกับเครื่องโป๊กเกอร์แข่งขัน และยังถูกใช้ โดยส่วนใหญ่คู่แข่งสูงสุดในการแข่งขันโป๊กเกอร์คอมพิวเตอร์ประจำปีอื่น ๆ วิธียอดนิยมในการแข่งขันโป๊กเกอร์คอมพิวเตอร์เกี่ยวข้องกับผู้เล่นที่แข็งแกร่ง เช่นมนุษย์ดีหรือโปรแกรมคอมพิวเตอร์ที่ดี การสังเกต และเรียนรู้กลยุทธ์ที่เลียนแบบลักษณะการทำงาน เมื่อเร็ว ๆ นี้ นี้ได้เกี่ยวข้องกับกรณีตามเหตุผล ที่ของโปรแกรมปัจจุบันตัดสินใจเปรียบเทียบกับชุดของตัวอย่างที่เก็บไว้ในหน่วยความจำ ตัวอย่างคล้ายกันมากที่สุดจะใช้เพื่อเลือกการตอบสนองปัญหาเปิด:ในขณะที่ตอนนี้แข็งแกร่งมากในเกมโป๊กเกอร์จริงง่ายที่สุดที่เล่นในการแข่งขัน ผู้เล่นสองวงเงินโป๊กเกอร์เท็กซัสโฮลด์ โปรแกรมคอมพิวเตอร์จะมีปัญหาเปิดและความท้าทายใหม่ที่จำเป็นต้องได้รับการ:•รัฐพื้นที่ abstraction เกมโป๊กเกอร์มาตราส่วนบุคคลมีขนาดใหญ่เกินไปจะคำนวณว่ากลยุทธ์ที่ดีที่สุด แทน เทคนิคการผลิตเพื่อทำเกมลงขนาดซึ่งเราสามารถคำนวณกลยุทธ์ดี (แต่ไม่สมบูรณ์) นี้มักจะหมายถึง การค้นหาจุดตัดสินใจในเกมที่มีกลยุทธ์เหมือนกัน และผสานเข้าด้วยกัน เทคนิคอัตโนมัติสำหรับการทำเช่นนี้จะกลายเป็นสิ่งสำคัญมากเราเข้าไปจากผู้เล่นสองจำกัดเท็กซัสโฮลด์เกมขนาดใหญ่ เช่นเกมผู้เล่นหลายคน หรือไม่มีวงเงิน•เกมผู้เล่นหลายคน วิธีประสบความสำเร็จมากที่สุดวันที่สำหรับการทำตัวแทนคอมพิวเตอร์แข็งแกร่งโป๊กเกอร์ใช้คณิตศาสตร์ ของทฤษฎีเกมในการตัดสินใจวิธีการเลือกการดำเนินการเพื่อให้กลยุทธ์ไม่สูญเสียมาก หรือทั้งหมด ฝ่ายตรงข้ามที่เหมาะสมกับ ในเกมกับฝ่ายตรงข้ามมากกว่าหนึ่ง อย่างไรก็ตาม ประกันคณิตศาสตร์ของวิธีการนี้จะไม่มีประโยชน์ แทนที่จะใช้กลยุทธ์ precomputed เดี่ยว ตัวแทนโป๊กเกอร์ดีคอมพิวเตอร์แทนได้สังเกตของฝ่ายตรงข้าม และเลือกกลยุทธ์ที่ดีในการตอบสนอง ส่วนใหญ่มีปัญหาที่เปิดอยู่•ปรับให้ฝ่ายตรงข้ามในระหว่างการแข่งขัน เป้าหมายของโป๊กเกอร์คือการ เพิ่มจำนวนเงินที่ชนะจากฝ่ายตรงข้าม อย่างไรก็ตาม วิธี theoretic เกมเกม pursues เป้าหมายแตกต่างกัน การพยายามลดจำนวนเงินหายโมโหโก การชนะ มากที่สุดจากคู่ต่อสู้ ผู้เล่นได้สังเกตการกระทำของฝ่ายตรงข้าม ค้นหาข้อผิดพลาดอะไรคือการทำให้ฝ่ายตรงข้าม และเปลี่ยนแปลงกลยุทธ์ของพวกเขาในการตอบสนองการใช้ประโยชน์จากข้อผิดพลาดเหล่านั้น วิจัยคอมพิวเตอร์โป๊กเกอร์ได้นำไปสู่เทคนิคที่มีประสิทธิภาพที่ทำงานได้เมื่อฝ่ายตรงข้ามจะพบนับล้านของเกม แต่ผู้เล่นที่มนุษย์จะสามารถปรับกลยุทธ์ของพวกเขาหลังจากสิบเท่าหรือหลายร้อยเกม

การแปล กรุณารอสักครู่..

ผลลัพธ์ (ไทย) 2:[สำเนา]

คัดลอก!

การแปล กรุณารอสักครู่..

ผลลัพธ์ (ไทย) 3:[สำเนา]

คัดลอก!

โป๊กเกอร์

" ผู้บริหารอุตสาหกรรมและนักวิเคราะห์มักจะผิดพลาดพูดคุยเกี่ยวกับกลยุทธ์เป็นถ้ามันเป็นหมากรุกที่ตรงกัน แต่ในหมากรุก , คุณมีเพียงสองคู่แข่งแต่ละทรัพยากรที่เหมือนกัน และมีโชคมีบทบาทน้อยที่สุด โลกจริงมากขึ้นเช่นเกมโป๊กเกอร์ผู้เล่นหลายพยายามที่จะทำให้ดีที่สุดในสิ่งที่มือ Fortune มีแจกพวกเขา . " ( เดวิด moschella Computerworld , ,1999

โป๊กเกอร์เป็นแบบตัวอย่างของ Stochastic เกมไม่สมบูรณ์ข้อมูล เหล่านี้เป็นเกมที่มีโอกาสสุ่มได้ ( เมื่อบัตรจะได้รับไพ่จากสำรับ ) และข้อมูลบางอย่างถูกซ่อนจากผู้เล่น ( คู่แข่งของพวกเขาส่วนบุคคลบัตร ) โป๊กเกอร์ที่มีชื่อเสียงในด้านจิตวิทยาของ : หลอกจริง การ์ด แสร้งทำเป็นจุดอ่อนที่แข็งแรงและการอ่านการแสดงออกทางสีหน้าของฝ่ายตรงข้ามและ " บอก " ค้นพบสิ่งที่บัตรพวกเขาจะซ่อนอยู่ ทั้งๆที่ " มนุษย์ " องค์ประกอบคอมพิวเตอร์ยังสามารถเล่นโป๊กเกอร์ในระดับของทักษะ รวมทั้งหลอกและเล่นหากินอื่น โดยเริ่มเกมกับทฤษฎีเกม .
โป๊กเกอร์เป็นครอบครัวของการพนันเกมไพ่ที่มีตัวเลขที่แตกต่างกันของผู้เล่นการพนันโครงสร้างและกฎตัวแปรที่ได้รับความนิยมมากที่สุดในปัจจุบันของโป๊กเกอร์เล่นในคาสิโนและที่เห็นในโทรทัศน์ มันไม่ จำกัด Texas Hold ' em เกมและตัวแปรที่มีขีด จำกัด เท็กซัสโฮลด์ ได้ถูกใช้เป็น Name = ทดสอบ Comment วิจัยปัญญาประดิษฐ์ตั้งแต่ปี 1997 ตั้งแต่ปี 2006 ปีการแข่งขันโป๊กเกอร์คอมพิวเตอร์ได้รับอนุญาตให้นักวิจัย โปรแกรมเมอร์ และผู้เล่นโป๊กเกอร์ที่จะเล่นโปรแกรมโป๊กเกอร์ของพวกเขากับแต่ละอื่น ๆให้เราดูที่เทคนิคปัญญาประดิษฐ์ทำงานที่ดีที่สุดในการปฏิบัติ การแข่งขันส่งผลให้เกิดความก้าวหน้าทางด้านสาขา เช่น ทฤษฎีเกมคอมพิวเตอร์ และมีผลในขั้นตอนวิธีที่สามารถหากลยุทธ์ที่เหมาะสมสำหรับเกมคำสั่งของขนาด ใหญ่ กว่า หกได้ใช้ก่อนหน้านี้เทคนิค .
ที่ 2007 aaai การประชุมโปรแกรมที่เรียกว่า Polaris โดยมหาวิทยาลัยกลุ่มการวิจัยอัลเบอร์ต้าเล่นโป๊กเกอร์หัวขึ้นขีด จำกัด เท็กซัสโฮลด์มนุษย์สองด้าน ฟิล และ อาลี eslami ทั้งหมดในคนแรกเครื่องโป๊กเกอร์แชมป์ กว่าสี่โป๊กเกอร์ซ้ำตรงกับดาวเหนือจะครั้งเดียว มัดหนึ่งครั้ง และแพ้ 2 ครั้ง เสียราคาโดยรวมขอบเล็ก ในลาสเวกัสในปี 2008รุ่นปรับปรุงของดาวเหนือเล่นกับหกหัวมนุษย์ขึ้น จำกัด ผู้เชี่ยวชาญใน ที่สองคนเครื่องโป๊กเกอร์แชมป์ กว่า 6 นัด ชนะ 3 แพ้ 2 แต่ , และผูกหนึ่งชนะเหตุการณ์โดยขอบเล็ก นับเป็นครั้งแรกที่โปรแกรมมืออาชีพโป๊กเกอร์มนุษย์พ่ายแพ้ในการแข่งขันที่มีความหมาย โปรแกรมคอมพิวเตอร์ได้อย่างต่อเนื่องเพื่อปรับปรุงและคู่แข่งด้านบนในการแข่งขันโป๊กเกอร์คอมพิวเตอร์ประจำปีจะแข็งแกร่งมากขึ้นกว่ารุ่นของดาวเหนือที่ใช้ในการแข่งขัน 2008
ความท้าทายสำหรับงานวิจัยทางปัญญาประดิษฐ์ :
โป๊กเกอร์เสนอตั้งมากมายของงานวิจัยที่ท้าทายที่ไม่เห็นในความนิยม AI การวิจัยรูปแบบเช่นหมากฮอส หมากรุก และไป เหล่านี้รวมถึง :
- ความไม่สมบูรณ์ของข้อมูลที่ผู้เล่นหนึ่งหรือมากกว่าหนึ่งไม่สามารถตรวจสอบสถานะที่แน่นอนของเกม .
- สุ่มแบบสุ่มโอกาสที่มีผลต่อหลักสูตรของเกมผู้เล่นหลาย .
- ( สองถึงสิบ ) เกมที่มีมากกว่าหนึ่งคู่แข่งมีมากแตกต่างจากสองผู้เล่นเกม .
- ทำซ้ำปฏิสัมพันธ์ที่ผู้เล่นเล่นชุดยาวของเกม และ สามารถเรียนรู้และปรับให้เข้ากับฝ่ายตรงข้ามของพวกเขาในช่วงเวลา .
- ตัวแปรการจ่ายเงินเพื่อให้ผู้เล่นต้องปรับตัวเพื่อใช้ประโยชน์จากฝ่ายตรงข้ามของพวกเขาผิดพลาดในการขยายการชนะของพวกเขา วิธีการที่ปัจจุบัน :

ในเกมที่สองผู้เล่นโป๊กเกอร์ , วิธีการประสบความสำเร็จมากที่สุดวันที่ยึดทฤษฎีเกม เป็นสมดุลของแนชคือชุดของกลยุทธ์ หนึ่งสำหรับผู้เล่นแต่ละคน ที่ไม่มีผู้เล่นสามารถเพิ่มจำนวนเงินที่พวกเขาชนะโดยการเปลี่ยนกลยุทธ์ที่แตกต่างกันสองผู้เล่นซีโร่ซัมเกม เช่น เกมที่สองผู้เล่นโป๊กเกอร์ผู้เล่นคนหนึ่งใช้สมดุลของแนชกลยุทธ์รับประกันไม่แพ้กับคู่ปรับ , คู่อริที่เลวร้ายที่สุดแม้รู้ว่าสิ่งที่กลยุทธ์คือ ถ้าคู่ต่อสู้ทำผิดแล้วสมดุลของแนชกลยุทธ์ชนะ หากสมดุลของแนชกลยุทธ์จะคำนวณก่อนเกมเล่นมันสามารถใช้กับใด ๆศัตรู โดยไม่ต้องรู้อะไรเกี่ยวกับกลยุทธ์ของพวกเขา นี้เป็นคุณสมบัติที่ดีในโป๊กเกอร์ เพราะมันหมายถึงว่าเราสามารถใช้มันเพื่อที่เลวร้ายที่สุดผูกเกม ในขณะที่การเรียนรู้ข้อผิดพลาดอะไรที่ฝ่ายตรงข้ามคือ ทำเพื่อให้เราสามารถเปลี่ยนกลยุทธ์ของเราในการตอบสนองปีการแข่งขันโป๊กเกอร์คอมพิวเตอร์ทำให้ความพยายามอย่างมากต่อประเภทกลยุทธ์สมดุลแนช ในเกมมีขนาดใหญ่มากและโปรแกรมที่ดีที่สุดในขีด จำกัด เท็กซัสโฮลด์ผู้เล่นสองคน ตอนนี้สนิทกับสมดุลของแนชกลยุทธ์ วิธีการนี้ถูกนำมาใช้ในปี 2007 และ 2008 ชาย VS เครื่องโป๊กเกอร์การแข่งขันและยังใช้โดยส่วนใหญ่ของคู่แข่งด้านบนในการแข่งขันโป๊กเกอร์คอมพิวเตอร์ประจำปี
วิธีที่นิยมอื่น ๆในการแข่งขันโป๊กเกอร์คอมพิวเตอร์เกี่ยวข้องกับการสังเกตผู้เล่นที่แข็งแกร่ง เช่น มนุษย์ ดี หรือ ดี คอมพิวเตอร์ โปรแกรม และการเรียนรู้กลยุทธ์ที่เลียนแบบพฤติกรรมของพวกเขา เมื่อเร็ว ๆนี้ได้มีส่วนร่วมกรณีศึกษาการใช้เหตุผลซึ่งในการตัดสินใจในปัจจุบันของโปรแกรมเมื่อเทียบกับชุดของตัวอย่างที่เก็บไว้ในหน่วยความจำ ตัวอย่างที่คล้ายกันส่วนใหญ่จะใช้ในการเลือกคำตอบ ปัญหา :

เปิดในขณะที่โปรแกรมคอมพิวเตอร์ขณะนี้ที่แข็งแกร่งมากในเกมโป๊กเกอร์จริงง่ายเล่นในการแข่งขัน สองขีด จำกัด เท็กซัสโฮลด์โป๊กเกอร์ผู้เล่นมีเปิดหลายปัญหา และความท้าทายใหม่ที่ต้อง addressed :
- สภาพพื้นที่นามธรรม เกมโป๊กเกอร์คนไกลขนาดใหญ่เกินไปจริงๆ หากลยุทธ์ที่เหมาะสมได้ แทน เป็นเทคนิคที่ใช้เพื่อลดความซับซ้อนของเกม เหลือไซส์ที่เราสามารถคำนวณดี ( แต่ไม่สมบูรณ์ ) กลยุทธ์ นี้มักจะหมายถึงการหาจุดการตัดสินใจในเกมที่คล้ายกันกลยุทธ์และรวมเข้าด้วยกันเทคนิคสำหรับการทำเช่นนี้โดยอัตโนมัติจะกลายเป็นมากขึ้นและที่สำคัญตามที่เราย้ายจากสองขีด จำกัด เท็กซัสโฮลด์เกมผู้เล่นขนาดใหญ่ เช่นไม่มีขีดจำกัดหรือหลายผู้เล่นเกม
- หลายผู้เล่นเกม ที่ประสบความสำเร็จมากที่สุดวิธีการอาจทำให้ตัวแทนโป๊กเกอร์คอมพิวเตอร์ที่แข็งแกร่งใช้คณิตศาสตร์ทฤษฎีเกมที่จะตัดสินใจเลือกวิธีการเลือกการกระทำเพื่อให้กลยุทธ์ที่ไม่สามารถสูญเสียมากหรือทั้งหมดกับฝ่ายตรงข้ามที่สมบูรณ์แบบ ในเกมที่มีมากกว่าหนึ่งของฝ่ายตรงข้าม อย่างไรก็ตาม การค้ำประกันทางคณิตศาสตร์ของวิธีการนี้จะไม่เป็นประโยชน์ แทนการใช้กลยุทธ์ precomputed เดียวตัวแทนโป๊กเกอร์ที่ดีคอมพิวเตอร์แทนจะต้องสังเกตของฝ่ายตรงข้ามและเลือกกลยุทธ์ที่ใช้ได้ดีในการตอบสนอง นี้เป็นส่วนใหญ่ปัญหาเปิด
- ปรับให้ฝ่ายตรงข้ามในระหว่างการแข่งขันเป้าหมายของโป๊กเกอร์คือการ เพิ่มจำนวนเงินที่ได้รับจากฝ่ายตรงข้าม อย่างไรก็ตาม เกมทฤษฎีแนวทางเกม pursues เป้าหมายแตกต่างกัน โดยพยายามที่จะลดเงินหายไปเท่าไหร่กับคู่ต่อสู้ที่สมบูรณ์แบบ เพื่อที่จะชนะมากที่สุดเท่าที่เป็นไปได้จากฝ่ายตรงข้าม ผู้เล่นได้สังเกตการกระทำของฝ่ายตรงข้าม , หาสิ่งที่ผิดฝ่ายตรงข้ามคือ การทําและเปลี่ยนกลยุทธ์ของพวกเขาในการตอบสนองการใช้ประโยชน์จากข้อผิดพลาดเหล่านั้น เพื่อโป๊กเกอร์คอมพิวเตอร์ได้นำเทคนิคงานนั้นมีประสิทธิภาพเมื่อฝ่ายตรงข้ามเป็นสังเกตสำหรับล้านของเกม แต่ผู้เล่นของมนุษย์จะสามารถปรับกลยุทธ์ของพวกเขาหลังจากเพียงสิบหรือหลายร้อยเกม

การแปล กรุณารอสักครู่..

ภาษาอื่น ๆ

การสนับสนุนเครื่องมือแปลภาษา: กรีก, กันนาดา, กาลิเชียน, คลิงออน, คอร์สิกา, คาซัค, คาตาลัน, คินยารวันดา, คีร์กิซ, คุชราต, จอร์เจีย, จีน, จีนดั้งเดิม, ชวา, ชิเชวา, ซามัว, ซีบัวโน, ซุนดา, ซูลู, ญี่ปุ่น, ดัตช์, ตรวจหาภาษา, ตุรกี, ทมิฬ, ทาจิก, ทาทาร์, นอร์เวย์, บอสเนีย, บัลแกเรีย, บาสก์, ปัญจาป, ฝรั่งเศส, พาชตู, ฟริเชียน, ฟินแลนด์, ฟิลิปปินส์, ภาษาอินโดนีเซี, มองโกเลีย, มัลทีส, มาซีโดเนีย, มาราฐี, มาลากาซี, มาลายาลัม, มาเลย์, ม้ง, ยิดดิช, ยูเครน, รัสเซีย, ละติน, ลักเซมเบิร์ก, ลัตเวีย, ลาว, ลิทัวเนีย, สวาฮิลี, สวีเดน, สิงหล, สินธี, สเปน, สโลวัก, สโลวีเนีย, อังกฤษ, อัมฮาริก, อาร์เซอร์ไบจัน, อาร์เมเนีย, อาหรับ, อิกโบ, อิตาลี, อุยกูร์, อุสเบกิสถาน, อูรดู, ฮังการี, ฮัวซา, ฮาวาย, ฮินดี, ฮีบรู, เกลิกสกอต, เกาหลี, เขมร, เคิร์ด, เช็ก, เซอร์เบียน, เซโซโท, เดนมาร์ก, เตลูกู, เติร์กเมน, เนปาล, เบงกอล, เบลารุส, เปอร์เซีย, เมารี, เมียนมา (พม่า), เยอรมัน, เวลส์, เวียดนาม, เอสเปอแรนโต, เอสโทเนีย, เฮติครีโอล, แอฟริกา, แอลเบเนีย, โคซา, โครเอเชีย, โชนา, โซมาลี, โปรตุเกส, โปแลนด์, โยรูบา, โรมาเนีย, โอเดีย (โอริยา), ไทย, ไอซ์แลนด์, ไอร์แลนด์, การแปลภาษา.