Reinforcement Learning beginner to master - AI in Python

Brought by: Udemy

Overview

Build Artificial Intelligence (AI) agents using Deep Reinforcement Learning and PyTorch: A2C, REINFORCE, DQN, etc.

What you'll learn:
  • Understand the Reinforcement Learning paradigm and the tasks that it's best suited to solve.
  • Understand the process of solving a cognitive task using Reinforcement Learning
  • Understand the different approaches to solving a task using Reinforcement Learning and choose the most fitting
  • Implement Reinforcement Learning algorithms completely from scratch
  • Fundamentally understand the learning process for each algorithm
  • Debug and extend the algorithms presented
  • Understand and implement new algorithms from research papers

This is the most complete Reinforcement Learning course on Udemy. In it you will learn the basics of Reinforcement Learning, one of the three paradigms of modern artificial intelligence. You will implement from scratch adaptive algorithms that solve control tasks based on experience. You will also learn to combine these algorithms with Deep Learning techniques and neural networks, giving rise to the branch known as Deep Reinforcement Learning.


This course will give you the foundation you need to be able to understand new algorithms as they emerge. It will also prepare you for the next courses in this series, in which we will go much deeper into different branches of Reinforcement Learning and look at some of the more advanced algorithms that exist.


The course is focused on developing practical skills. Therefore, after learning the most important concepts of each family of methods, we will implement one or more of their algorithms in jupyter notebooks, from scratch.


This course is divided into three parts and covers the following topics:


Part 1 (Tabular methods):


- Markov decision process


- Dynamic programming


- Monte Carlo methods


- Time difference methods (SARSA, Q-Learning)


- N-step bootstrapping


Part 2 (Continuous state spaces):


- State aggregation


- Tile Coding


Part 3 (Deep Reinforcement Learning):


- Deep SARSA


- Deep Q-Learning


- REINFORCE


- Advantage Actor-Critic / A2C (Advantage Actor-Critic / A2C method)


Taught by

Escape Velocity Labs

Reinforcement Learning beginner to master - AI in Python
Go to course

Reinforcement Learning beginner to master - AI in Python

Brought by: Udemy

  • Udemy
  • Paid
  • English
  • Certificate Available
  • Available at any time
  • beginner
  • English
8.1.2PHP Version1.11sRequest Duration2MBMemory UsageGET en/courses/{slug}Route
    • Booting (626ms)
    • Application (478ms)
    • 1 x Booting (56.61%)
      626.15ms
      1 x Application (43.17%)
      477.50ms
      14 templates were rendered
      • public.courses.show (resources/views/public/courses/show.blade.php)3bladefile
        Params
        0
        course
        1
        links
        2
        config
      • public.courses.partials.breadcrumbs (resources/views/public/courses/partials/breadcrumbs.blade.php)6bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
      • public.courses.partials.heading (resources/views/public/courses/partials/heading.blade.php)7bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
        6
        classes
      • public.courses.partials.details (resources/views/public/courses/partials/details.blade.php)6bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
      • public.courses.partials.breadcrumbs (resources/views/public/courses/partials/breadcrumbs.blade.php)6bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
      • public.courses.partials.heading (resources/views/public/courses/partials/heading.blade.php)7bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
        6
        classes
      • public.layouts.main (resources/views/public/layouts/main.blade.php)6bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
      • public.layouts.partials.meta (resources/views/public/layouts/partials/meta.blade.php)6bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
      • public.layouts.partials.navbar (resources/views/public/layouts/partials/navbar.blade.php)6bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
      • public.auth.profile.partials.links (resources/views/public/auth/profile/partials/links.blade.php)6bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
      • public.auth.profile.partials.link (resources/views/public/auth/profile/partials/link.blade.php)8bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
        6
        route
        7
        title
      • public.auth.profile.partials.link (resources/views/public/auth/profile/partials/link.blade.php)8bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
        6
        route
        7
        title
      • public.auth.profile.partials.link (resources/views/public/auth/profile/partials/link.blade.php)8bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
        6
        route
        7
        title
      • public.layouts.partials.flash-session (resources/views/public/layouts/partials/flash-session.blade.php)6bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
      uri
      GET en/courses/{slug}
      middleware
      web, localize:en
      controller
      App\Http\Controllers\CourseController@show
      as
      en.courses.show
      namespace
      prefix
      /en
      where
      file
      app/Http/Controllers/CourseController.php:17-35
      6 statements were executed152ms
      • select * from `courses` where `slug_en` = 'reinforcement-learning-beginner-to-master---ai-in-python' limit 1
        15.48ms/app/Http/Controllers/CourseController.php:20corspedia
        Metadata
        Bindings
        • 0. reinforcement-learning-beginner-to-master---ai-in-python
        Backtrace
        • 17. /app/Http/Controllers/CourseController.php:20
        • 18. /vendor/laravel/framework/src/Illuminate/Routing/Controller.php:54
        • 19. /vendor/laravel/framework/src/Illuminate/Routing/ControllerDispatcher.php:43
        • 20. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:260
        • 21. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:205
      • update `courses` set `visitors` = `visitors` + 1, `courses`.`updated_at` = '2025-06-13 07:18:07' where `id` = 4248
        135ms/app/Http/Controllers/CourseController.php:21corspedia
        Metadata
        Bindings
        • 0. 2025-06-13 07:18:07
        • 1. 4248
        Backtrace
        • 17. /app/Http/Controllers/CourseController.php:21
        • 18. /vendor/laravel/framework/src/Illuminate/Routing/Controller.php:54
        • 19. /vendor/laravel/framework/src/Illuminate/Routing/ControllerDispatcher.php:43
        • 20. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:260
        • 21. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:205
      • select `id`, `name_en`, `name_ar`, `topic_id`, `slug_en`, `slug_ar` from `subjects` where `subjects`.`id` in (62)
        250μs/app/Http/Controllers/CourseController.php:23corspedia
        Metadata
        Backtrace
        • 20. /app/Http/Controllers/CourseController.php:23
        • 21. /vendor/laravel/framework/src/Illuminate/Routing/Controller.php:54
        • 22. /vendor/laravel/framework/src/Illuminate/Routing/ControllerDispatcher.php:43
        • 23. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:260
        • 24. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:205
      • select `id`, `name_en`, `name_ar`, `slug_en`, `slug_ar` from `topics` where `topics`.`id` in (1)
        220μs/app/Http/Controllers/CourseController.php:23corspedia
        Metadata
        Backtrace
        • 25. /app/Http/Controllers/CourseController.php:23
        • 26. /vendor/laravel/framework/src/Illuminate/Routing/Controller.php:54
        • 27. /vendor/laravel/framework/src/Illuminate/Routing/ControllerDispatcher.php:43
        • 28. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:260
        • 29. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:205
      • select * from `providers` where `providers`.`id` in (51) and `providers`.`deleted_at` is null
        250μs/app/Http/Controllers/CourseController.php:23corspedia
        Metadata
        Backtrace
        • 20. /app/Http/Controllers/CourseController.php:23
        • 21. /vendor/laravel/framework/src/Illuminate/Routing/Controller.php:54
        • 22. /vendor/laravel/framework/src/Illuminate/Routing/ControllerDispatcher.php:43
        • 23. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:260
        • 24. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:205
      • select * from `html_files` where `html_files`.`id` = 4239 limit 1
        270μs/app/Models/Course.php:84corspedia
        Metadata
        Bindings
        • 0. 4239
        Backtrace
        • 21. /app/Models/Course.php:84
        • 28. view::public.courses.show:29
        • 30. /vendor/laravel/framework/src/Illuminate/Filesystem/Filesystem.php:125
        • 31. /vendor/laravel/framework/src/Illuminate/View/Engines/PhpEngine.php:58
        • 32. /vendor/laravel/framework/src/Illuminate/View/Engines/CompilerEngine.php:72
      App\Models\HtmlFile
      1
      App\Models\Provider
      1
      App\Models\Topic
      1
      App\Models\Subject
      1
      App\Models\Course
      1
        _token
        BUw3T0RFz0kpMFHkXp3v26p5w1vFkomNvi31pYCu
        locale
        en
        _previous
        array:1 [ "url" => "https://www.corspedia.com/en/courses/reinforcement-learning-beginner-to-master...
        _flash
        array:2 [ "old" => [] "new" => [] ]
        PHPDEBUGBAR_STACK_DATA
        []
        path_info
        /en/courses/reinforcement-learning-beginner-to-master---ai-in-python
        status_code
        200
        
        status_text
        OK
        format
        html
        content_type
        text/html; charset=UTF-8
        request_query
        []
        
        request_request
        []
        
        request_headers
        0 of 0
        array:24 [ "cf-ipcountry" => array:1 [ 0 => "US" ] "cf-connecting-ip" => array:1 [ 0 => "216.73.216.14" ] "cdn-loop" => array:1 [ 0 => "cloudflare; loops=1" ] "x-forwarded-proto" => array:1 [ 0 => "https" ] "x-forwarded-for" => array:1 [ 0 => "216.73.216.14" ] "sec-fetch-site" => array:1 [ 0 => "none" ] "accept" => array:1 [ 0 => "text/html,application/xhtml+xml,application/xml;q=0.9,image/avif,image/webp,image/apng,*/*;q=0.8,application/signed-exchange;v=b3;q=0.7" ] "user-agent" => array:1 [ 0 => "Mozilla/5.0 AppleWebKit/537.36 (KHTML, like Gecko; compatible; ClaudeBot/1.0; +claudebot@anthropic.com)" ] "upgrade-insecure-requests" => array:1 [ 0 => "1" ] "sec-ch-ua-platform" => array:1 [ 0 => ""Windows"" ] "sec-ch-ua-mobile" => array:1 [ 0 => "?0" ] "sec-ch-ua" => array:1 [ 0 => ""Chromium";v="130", "HeadlessChrome";v="130", "Not?A_Brand";v="99"" ] "cache-control" => array:1 [ 0 => "no-cache" ] "pragma" => array:1 [ 0 => "no-cache" ] "sec-fetch-dest" => array:1 [ 0 => "document" ] "cf-ray" => array:1 [ 0 => "94efcfe0985910ab-ORD" ] "accept-encoding" => array:1 [ 0 => "gzip, br" ] "priority" => array:1 [ 0 => "u=0, i" ] "sec-fetch-user" => array:1 [ 0 => "?1" ] "sec-fetch-mode" => array:1 [ 0 => "navigate" ] "cf-visitor" => array:1 [ 0 => "{"scheme":"https"}" ] "host" => array:1 [ 0 => "www.corspedia.com" ] "content-length" => array:1 [ 0 => "" ] "content-type" => array:1 [ 0 => "" ] ]
        request_server
        0 of 0
        array:50 [ "USER" => "www-data" "HOME" => "/var/www" "HTTP_CF_IPCOUNTRY" => "US" "HTTP_CF_CONNECTING_IP" => "216.73.216.14" "HTTP_CDN_LOOP" => "cloudflare; loops=1" "HTTP_X_FORWARDED_PROTO" => "https" "HTTP_X_FORWARDED_FOR" => "216.73.216.14" "HTTP_SEC_FETCH_SITE" => "none" "HTTP_ACCEPT" => "text/html,application/xhtml+xml,application/xml;q=0.9,image/avif,image/webp,image/apng,*/*;q=0.8,application/signed-exchange;v=b3;q=0.7" "HTTP_USER_AGENT" => "Mozilla/5.0 AppleWebKit/537.36 (KHTML, like Gecko; compatible; ClaudeBot/1.0; +claudebot@anthropic.com)" "HTTP_UPGRADE_INSECURE_REQUESTS" => "1" "HTTP_SEC_CH_UA_PLATFORM" => ""Windows"" "HTTP_SEC_CH_UA_MOBILE" => "?0" "HTTP_SEC_CH_UA" => ""Chromium";v="130", "HeadlessChrome";v="130", "Not?A_Brand";v="99"" "HTTP_CACHE_CONTROL" => "no-cache" "HTTP_PRAGMA" => "no-cache" "HTTP_SEC_FETCH_DEST" => "document" "HTTP_CF_RAY" => "94efcfe0985910ab-ORD" "HTTP_ACCEPT_ENCODING" => "gzip, br" "HTTP_PRIORITY" => "u=0, i" "HTTP_SEC_FETCH_USER" => "?1" "HTTP_SEC_FETCH_MODE" => "navigate" "HTTP_CF_VISITOR" => "{"scheme":"https"}" "HTTP_HOST" => "www.corspedia.com" "REDIRECT_STATUS" => "200" "SERVER_NAME" => "corspedia.com" "SERVER_PORT" => "443" "SERVER_ADDR" => "141.95.147.152" "REMOTE_USER" => "" "REMOTE_PORT" => "21118" "REMOTE_ADDR" => "172.71.1.137" "SERVER_SOFTWARE" => "nginx/1.18.0" "GATEWAY_INTERFACE" => "CGI/1.1" "HTTPS" => "on" "REQUEST_SCHEME" => "https" "SERVER_PROTOCOL" => "HTTP/2.0" "DOCUMENT_ROOT" => "/var/www/corspedia/public" "DOCUMENT_URI" => "/index.php" "REQUEST_URI" => "/en/courses/reinforcement-learning-beginner-to-master---ai-in-python" "SCRIPT_NAME" => "/index.php" "CONTENT_LENGTH" => "" "CONTENT_TYPE" => "" "REQUEST_METHOD" => "GET" "QUERY_STRING" => "" "SCRIPT_FILENAME" => "/var/www/corspedia/public/index.php" "PATH_INFO" => "" "FCGI_ROLE" => "RESPONDER" "PHP_SELF" => "/index.php" "REQUEST_TIME_FLOAT" => 1749799086.4572 "REQUEST_TIME" => 1749799086 ]
        request_cookies
        []
        
        response_headers
        0 of 0
        array:5 [ "content-type" => array:1 [ 0 => "text/html; charset=UTF-8" ] "cache-control" => array:1 [ 0 => "no-cache, private" ] "date" => array:1 [ 0 => "Fri, 13 Jun 2025 07:18:07 GMT" ] "set-cookie" => array:2 [ 0 => "XSRF-TOKEN=eyJpdiI6IlV4b1pDT3BCWWhWcDFPY2Q5bUFyQ0E9PSIsInZhbHVlIjoiVGdwK3AwUnVZb0pXYm5HK0xqNDN2cjBZd1FQZi9GblIvbytUc1N6dnZJaHBaSlFTZzNLTXl1RDFVa3FTZTlaQmQ5T0tJUWtETXI3VnplOGJpWEhOOVpxVFAwS0Zuc2xKWlRzb0NLb3gzbWVZd3FQdkNGU2Y0enVwalJ2aGpmb0IiLCJtYWMiOiJmZTAzYTNlMGMxY2ZmMTA5NTQyYjIyZDRmZGY3NzViMjJkNTQxOGM1M2MxMWMzYWQ1ZDRhMzkzNjJhMzJkOTQ0IiwidGFnIjoiIn0%3D; expires=Fri, 13 Jun 2025 09:18:07 GMT; Max-Age=7200; path=/; samesite=laxXSRF-TOKEN=eyJpdiI6IlV4b1pDT3BCWWhWcDFPY2Q5bUFyQ0E9PSIsInZhbHVlIjoiVGdwK3AwUnVZb0pXYm5HK0xqNDN2cjBZd1FQZi9GblIvbytUc1N6dnZJaHBaSlFTZzNLTXl1RDFVa3FTZTlaQmQ5T0tJU" 1 => "laravel_session=eyJpdiI6IkUyOE9XbTA1S0hkNGNxRndmQ2JmZFE9PSIsInZhbHVlIjoiZ2N4aUNFSWd0TnRoSXM4VHc4UTIxQ1RDRE5zbGRZYkgxMnZZNFd1Smxza1c5VzNaa1Y5WDdKU3hnUjZobEhWUG9JOVdjcU5VS0s5R2dWaXNmemZXTmcrVjVtVlJzUFM3NlZNMFA2RzJRODJWSzVaZGloZGJzMitLZTNKZklXNXciLCJtYWMiOiIyN2JkMmZmMDQyMzgwNWNlNzA1MWVlYjhlM2IzN2UzNTJhMTQyOWY1MGE1ZDBjOTFlMWY3ZGNmZDZiYzEzYTVjIiwidGFnIjoiIn0%3D; expires=Fri, 13 Jun 2025 09:18:07 GMT; Max-Age=7200; path=/; httponly; samesite=laxlaravel_session=eyJpdiI6IkUyOE9XbTA1S0hkNGNxRndmQ2JmZFE9PSIsInZhbHVlIjoiZ2N4aUNFSWd0TnRoSXM4VHc4UTIxQ1RDRE5zbGRZYkgxMnZZNFd1Smxza1c5VzNaa1Y5WDdKU3hnUjZobEhWUG9J" ] "Set-Cookie" => array:2 [ 0 => "XSRF-TOKEN=eyJpdiI6IlV4b1pDT3BCWWhWcDFPY2Q5bUFyQ0E9PSIsInZhbHVlIjoiVGdwK3AwUnVZb0pXYm5HK0xqNDN2cjBZd1FQZi9GblIvbytUc1N6dnZJaHBaSlFTZzNLTXl1RDFVa3FTZTlaQmQ5T0tJUWtETXI3VnplOGJpWEhOOVpxVFAwS0Zuc2xKWlRzb0NLb3gzbWVZd3FQdkNGU2Y0enVwalJ2aGpmb0IiLCJtYWMiOiJmZTAzYTNlMGMxY2ZmMTA5NTQyYjIyZDRmZGY3NzViMjJkNTQxOGM1M2MxMWMzYWQ1ZDRhMzkzNjJhMzJkOTQ0IiwidGFnIjoiIn0%3D; expires=Fri, 13-Jun-2025 09:18:07 GMT; path=/XSRF-TOKEN=eyJpdiI6IlV4b1pDT3BCWWhWcDFPY2Q5bUFyQ0E9PSIsInZhbHVlIjoiVGdwK3AwUnVZb0pXYm5HK0xqNDN2cjBZd1FQZi9GblIvbytUc1N6dnZJaHBaSlFTZzNLTXl1RDFVa3FTZTlaQmQ5T0tJU" 1 => "laravel_session=eyJpdiI6IkUyOE9XbTA1S0hkNGNxRndmQ2JmZFE9PSIsInZhbHVlIjoiZ2N4aUNFSWd0TnRoSXM4VHc4UTIxQ1RDRE5zbGRZYkgxMnZZNFd1Smxza1c5VzNaa1Y5WDdKU3hnUjZobEhWUG9JOVdjcU5VS0s5R2dWaXNmemZXTmcrVjVtVlJzUFM3NlZNMFA2RzJRODJWSzVaZGloZGJzMitLZTNKZklXNXciLCJtYWMiOiIyN2JkMmZmMDQyMzgwNWNlNzA1MWVlYjhlM2IzN2UzNTJhMTQyOWY1MGE1ZDBjOTFlMWY3ZGNmZDZiYzEzYTVjIiwidGFnIjoiIn0%3D; expires=Fri, 13-Jun-2025 09:18:07 GMT; path=/; httponlylaravel_session=eyJpdiI6IkUyOE9XbTA1S0hkNGNxRndmQ2JmZFE9PSIsInZhbHVlIjoiZ2N4aUNFSWd0TnRoSXM4VHc4UTIxQ1RDRE5zbGRZYkgxMnZZNFd1Smxza1c5VzNaa1Y5WDdKU3hnUjZobEhWUG9J" ] ]
        session_attributes
        0 of 0
        array:5 [ "_token" => "BUw3T0RFz0kpMFHkXp3v26p5w1vFkomNvi31pYCu" "locale" => "en" "_previous" => array:1 [ "url" => "https://www.corspedia.com/en/courses/reinforcement-learning-beginner-to-master---ai-in-python" ] "_flash" => array:2 [ "old" => [] "new" => [] ] "PHPDEBUGBAR_STACK_DATA" => [] ]