Examples#
LP = Linear Programming, MIP = Mixed Integer Programming, DP = Dynamic Programming, RL = (Model-Free) Reinforcement Learning, MCTS = Monte-Carlo Tree Search
| Problem | Description | Methods | 
|---|---|---|
| Optimizes the transition function of a Turing machine to maximize the sum of the value on its tape | Search | |
| Optimizes the action sequence of a cart to maximize the stabilization time of an inverted pendulum.  | RL | |
| Optimizes the route of a player to escape from a maze without hitting the wall | DP, RL, Search | |
| Optimizes the ratio of beef and chichen to meet the nutritional standard with lowest cost | LP | |
| Finding a feasible solution of 9x9 Soduku | MIP | |
| Optimize the selection of investments to maximize the total contribution with limited availability of cash and manpower | MIP, Metaheuristics, Search, DP | |
| Optimize the selection of items to maximize the total value without exceeding the capacity of the knapsack | MIP, Metaheuristics, Search, DP, RL, RL+MCTS | |
| Optimize the route of a salesman that visits each city exactly once and returns to the origin city, so that the total length of the route is minimized | MIP, Metaheuristics, Search, DP, RL | |
| Optimize the selection of an increasing sub-sequence from an array to maximize its length. | DP, Search | |
| Optimize the bi-partition of an array to minimize the gap between the sum of the elements in both sets. | DP | 







