Examples#
LP = Linear Programming, MIP = Mixed Integer Programming, DP = Dynamic Programming, RL = (Model-Free) Reinforcement Learning, MCTS = Monte-Carlo Tree Search
Problem |
Description |
Methods |
---|---|---|
Optimizes the transition function of a Turing machine to maximize the sum of the value on its tape |
Search |
|
Optimizes the action sequence of a cart to maximize the stabilization time of an inverted pendulum. |
RL |
|
Optimizes the route of a player to escape from a maze without hitting the wall |
DP, RL, Search |
|
Optimizes the ratio of beef and chichen to meet the nutritional standard with lowest cost |
LP |
|
Finding a feasible solution of 9x9 Soduku |
MIP |
|
Optimize the selection of investments to maximize the total contribution with limited availability of cash and manpower |
MIP, Metaheuristics, Search, DP |
|
Optimize the selection of items to maximize the total value without exceeding the capacity of the knapsack |
MIP, Metaheuristics, Search, DP, RL, RL+MCTS |
|
Optimize the route of a salesman that visits each city exactly once and returns to the origin city, so that the total length of the route is minimized |
MIP, Metaheuristics, Search, DP, RL |
|
Optimize the selection of an increasing sub-sequence from an array to maximize its length. |
DP, Search |
|
Optimize the bi-partition of an array to minimize the gap between the sum of the elements in both sets. |
DP |