Reinforcement Learning beginner to master - AI in Python

بواسطة: Udemy

Overview

Build Artificial Intelligence (AI) agents using Deep Reinforcement Learning and PyTorch: A2C, REINFORCE, DQN, etc.

What you'll learn:
  • Understand the Reinforcement Learning paradigm and the tasks that it's best suited to solve.
  • Understand the process of solving a cognitive task using Reinforcement Learning
  • Understand the different approaches to solving a task using Reinforcement Learning and choose the most fitting
  • Implement Reinforcement Learning algorithms completely from scratch
  • Fundamentally understand the learning process for each algorithm
  • Debug and extend the algorithms presented
  • Understand and implement new algorithms from research papers

This is the most complete Reinforcement Learning course on Udemy. In it you will learn the basics of Reinforcement Learning, one of the three paradigms of modern artificial intelligence. You will implement from scratch adaptive algorithms that solve control tasks based on experience. You will also learn to combine these algorithms with Deep Learning techniques and neural networks, giving rise to the branch known as Deep Reinforcement Learning.


This course will give you the foundation you need to be able to understand new algorithms as they emerge. It will also prepare you for the next courses in this series, in which we will go much deeper into different branches of Reinforcement Learning and look at some of the more advanced algorithms that exist.


The course is focused on developing practical skills. Therefore, after learning the most important concepts of each family of methods, we will implement one or more of their algorithms in jupyter notebooks, from scratch.


This course is divided into three parts and covers the following topics:


Part 1 (Tabular methods):


- Markov decision process


- Dynamic programming


- Monte Carlo methods


- Time difference methods (SARSA, Q-Learning)


- N-step bootstrapping


Part 2 (Continuous state spaces):


- State aggregation


- Tile Coding


Part 3 (Deep Reinforcement Learning):


- Deep SARSA


- Deep Q-Learning


- REINFORCE


- Advantage Actor-Critic / A2C (Advantage Actor-Critic / A2C method)


Taught by

Escape Velocity Labs

Reinforcement Learning beginner to master - AI in Python
الذهاب الي الدورة

Reinforcement Learning beginner to master - AI in Python

بواسطة: Udemy

  • Udemy
  • مدفوعة
  • الإنجليزية
  • متاح شهادة
  • متاح في أي وقت
  • beginner
  • English
8.1.2PHP Version445msRequest Duration2MBMemory UsageGET ar/الدورات/{slug}Route
    • Booting (157ms)
    • Application (288ms)
    • 1 x Application (64.6%)
      287.71ms
      1 x Booting (35.26%)
      157.02ms
      14 templates were rendered
      • public.courses.show (resources/views/public/courses/show.blade.php)3bladefile
        Params
        0
        course
        1
        links
        2
        config
      • public.courses.partials.breadcrumbs (resources/views/public/courses/partials/breadcrumbs.blade.php)6bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
      • public.courses.partials.heading (resources/views/public/courses/partials/heading.blade.php)7bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
        6
        classes
      • public.courses.partials.details (resources/views/public/courses/partials/details.blade.php)6bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
      • public.courses.partials.breadcrumbs (resources/views/public/courses/partials/breadcrumbs.blade.php)6bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
      • public.courses.partials.heading (resources/views/public/courses/partials/heading.blade.php)7bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
        6
        classes
      • public.layouts.main (resources/views/public/layouts/main.blade.php)6bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
      • public.layouts.partials.meta (resources/views/public/layouts/partials/meta.blade.php)6bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
      • public.layouts.partials.navbar (resources/views/public/layouts/partials/navbar.blade.php)6bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
      • public.auth.profile.partials.links (resources/views/public/auth/profile/partials/links.blade.php)6bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
      • public.auth.profile.partials.link (resources/views/public/auth/profile/partials/link.blade.php)8bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
        6
        route
        7
        title
      • public.auth.profile.partials.link (resources/views/public/auth/profile/partials/link.blade.php)8bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
        6
        route
        7
        title
      • public.auth.profile.partials.link (resources/views/public/auth/profile/partials/link.blade.php)8bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
        6
        route
        7
        title
      • public.layouts.partials.flash-session (resources/views/public/layouts/partials/flash-session.blade.php)6bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
      uri
      GET ar/الدورات/{slug}
      middleware
      web, localize:ar
      controller
      App\Http\Controllers\CourseController@show
      as
      ar.courses.show
      namespace
      prefix
      /ar
      where
      file
      app/Http/Controllers/CourseController.php:17-35
      6 statements were executed168ms
      • select * from `courses` where `slug_ar` = 'reinforcement-learning-beginner-to-master---ai-in-python' limit 1
        8.1ms/app/Http/Controllers/CourseController.php:20corspedia
        Metadata
        Bindings
        • 0. reinforcement-learning-beginner-to-master---ai-in-python
        Backtrace
        • 17. /app/Http/Controllers/CourseController.php:20
        • 18. /vendor/laravel/framework/src/Illuminate/Routing/Controller.php:54
        • 19. /vendor/laravel/framework/src/Illuminate/Routing/ControllerDispatcher.php:43
        • 20. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:260
        • 21. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:205
      • update `courses` set `visitors` = `visitors` + 1, `courses`.`updated_at` = '2025-05-15 01:01:20' where `id` = 4248
        159ms/app/Http/Controllers/CourseController.php:21corspedia
        Metadata
        Bindings
        • 0. 2025-05-15 01:01:20
        • 1. 4248
        Backtrace
        • 17. /app/Http/Controllers/CourseController.php:21
        • 18. /vendor/laravel/framework/src/Illuminate/Routing/Controller.php:54
        • 19. /vendor/laravel/framework/src/Illuminate/Routing/ControllerDispatcher.php:43
        • 20. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:260
        • 21. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:205
      • select `id`, `name_en`, `name_ar`, `topic_id`, `slug_en`, `slug_ar` from `subjects` where `subjects`.`id` in (62)
        390μs/app/Http/Controllers/CourseController.php:23corspedia
        Metadata
        Backtrace
        • 20. /app/Http/Controllers/CourseController.php:23
        • 21. /vendor/laravel/framework/src/Illuminate/Routing/Controller.php:54
        • 22. /vendor/laravel/framework/src/Illuminate/Routing/ControllerDispatcher.php:43
        • 23. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:260
        • 24. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:205
      • select `id`, `name_en`, `name_ar`, `slug_en`, `slug_ar` from `topics` where `topics`.`id` in (1)
        260μs/app/Http/Controllers/CourseController.php:23corspedia
        Metadata
        Backtrace
        • 25. /app/Http/Controllers/CourseController.php:23
        • 26. /vendor/laravel/framework/src/Illuminate/Routing/Controller.php:54
        • 27. /vendor/laravel/framework/src/Illuminate/Routing/ControllerDispatcher.php:43
        • 28. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:260
        • 29. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:205
      • select * from `providers` where `providers`.`id` in (51) and `providers`.`deleted_at` is null
        180μs/app/Http/Controllers/CourseController.php:23corspedia
        Metadata
        Backtrace
        • 20. /app/Http/Controllers/CourseController.php:23
        • 21. /vendor/laravel/framework/src/Illuminate/Routing/Controller.php:54
        • 22. /vendor/laravel/framework/src/Illuminate/Routing/ControllerDispatcher.php:43
        • 23. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:260
        • 24. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:205
      • select * from `html_files` where `html_files`.`id` = 4239 limit 1
        400μs/app/Models/Course.php:84corspedia
        Metadata
        Bindings
        • 0. 4239
        Backtrace
        • 21. /app/Models/Course.php:84
        • 28. view::public.courses.show:29
        • 30. /vendor/laravel/framework/src/Illuminate/Filesystem/Filesystem.php:125
        • 31. /vendor/laravel/framework/src/Illuminate/View/Engines/PhpEngine.php:58
        • 32. /vendor/laravel/framework/src/Illuminate/View/Engines/CompilerEngine.php:72
      App\Models\HtmlFile
      1
      App\Models\Provider
      1
      App\Models\Topic
      1
      App\Models\Subject
      1
      App\Models\Course
      1
        _token
        XnWkqWLZZhigysEZq5n6FS8aWQd03bs5ohBgInlW
        locale
        ar
        _previous
        array:1 [ "url" => "https://www.corspedia.com/ar/%D8%A7%D9%84%D8%AF%D9%88%D8%B1%D8%A7%D8%AA/reinfo...
        _flash
        array:2 [ "old" => [] "new" => [] ]
        PHPDEBUGBAR_STACK_DATA
        []
        path_info
        /ar/%D8%A7%D9%84%D8%AF%D9%88%D8%B1%D8%A7%D8%AA/reinforcement-learning-beginner-to-master---ai-in-python
        status_code
        200
        
        status_text
        OK
        format
        html
        content_type
        text/html; charset=UTF-8
        request_query
        []
        
        request_request
        []
        
        request_headers
        0 of 0
        array:24 [ "cf-ipcountry" => array:1 [ 0 => "US" ] "cf-connecting-ip" => array:1 [ 0 => "3.136.106.107" ] "cdn-loop" => array:1 [ 0 => "cloudflare; loops=1" ] "x-forwarded-proto" => array:1 [ 0 => "https" ] "x-forwarded-for" => array:1 [ 0 => "3.136.106.107" ] "sec-fetch-site" => array:1 [ 0 => "none" ] "accept" => array:1 [ 0 => "text/html,application/xhtml+xml,application/xml;q=0.9,image/avif,image/webp,image/apng,*/*;q=0.8,application/signed-exchange;v=b3;q=0.7" ] "user-agent" => array:1 [ 0 => "Mozilla/5.0 AppleWebKit/537.36 (KHTML, like Gecko; compatible; ClaudeBot/1.0; +claudebot@anthropic.com)" ] "upgrade-insecure-requests" => array:1 [ 0 => "1" ] "sec-ch-ua-platform" => array:1 [ 0 => ""Windows"" ] "sec-ch-ua-mobile" => array:1 [ 0 => "?0" ] "sec-ch-ua" => array:1 [ 0 => ""Chromium";v="130", "HeadlessChrome";v="130", "Not?A_Brand";v="99"" ] "cache-control" => array:1 [ 0 => "no-cache" ] "pragma" => array:1 [ 0 => "no-cache" ] "sec-fetch-dest" => array:1 [ 0 => "document" ] "cf-ray" => array:1 [ 0 => "93feb41a18321153-ORD" ] "accept-encoding" => array:1 [ 0 => "gzip, br" ] "priority" => array:1 [ 0 => "u=0, i" ] "sec-fetch-user" => array:1 [ 0 => "?1" ] "sec-fetch-mode" => array:1 [ 0 => "navigate" ] "cf-visitor" => array:1 [ 0 => "{"scheme":"https"}" ] "host" => array:1 [ 0 => "www.corspedia.com" ] "content-length" => array:1 [ 0 => "" ] "content-type" => array:1 [ 0 => "" ] ]
        request_server
        0 of 0
        array:50 [ "USER" => "www-data" "HOME" => "/var/www" "HTTP_CF_IPCOUNTRY" => "US" "HTTP_CF_CONNECTING_IP" => "3.136.106.107" "HTTP_CDN_LOOP" => "cloudflare; loops=1" "HTTP_X_FORWARDED_PROTO" => "https" "HTTP_X_FORWARDED_FOR" => "3.136.106.107" "HTTP_SEC_FETCH_SITE" => "none" "HTTP_ACCEPT" => "text/html,application/xhtml+xml,application/xml;q=0.9,image/avif,image/webp,image/apng,*/*;q=0.8,application/signed-exchange;v=b3;q=0.7" "HTTP_USER_AGENT" => "Mozilla/5.0 AppleWebKit/537.36 (KHTML, like Gecko; compatible; ClaudeBot/1.0; +claudebot@anthropic.com)" "HTTP_UPGRADE_INSECURE_REQUESTS" => "1" "HTTP_SEC_CH_UA_PLATFORM" => ""Windows"" "HTTP_SEC_CH_UA_MOBILE" => "?0" "HTTP_SEC_CH_UA" => ""Chromium";v="130", "HeadlessChrome";v="130", "Not?A_Brand";v="99"" "HTTP_CACHE_CONTROL" => "no-cache" "HTTP_PRAGMA" => "no-cache" "HTTP_SEC_FETCH_DEST" => "document" "HTTP_CF_RAY" => "93feb41a18321153-ORD" "HTTP_ACCEPT_ENCODING" => "gzip, br" "HTTP_PRIORITY" => "u=0, i" "HTTP_SEC_FETCH_USER" => "?1" "HTTP_SEC_FETCH_MODE" => "navigate" "HTTP_CF_VISITOR" => "{"scheme":"https"}" "HTTP_HOST" => "www.corspedia.com" "REDIRECT_STATUS" => "200" "SERVER_NAME" => "corspedia.com" "SERVER_PORT" => "443" "SERVER_ADDR" => "141.95.147.152" "REMOTE_USER" => "" "REMOTE_PORT" => "25428" "REMOTE_ADDR" => "172.69.58.123" "SERVER_SOFTWARE" => "nginx/1.18.0" "GATEWAY_INTERFACE" => "CGI/1.1" "HTTPS" => "on" "REQUEST_SCHEME" => "https" "SERVER_PROTOCOL" => "HTTP/2.0" "DOCUMENT_ROOT" => "/var/www/corspedia/public" "DOCUMENT_URI" => "/index.php" "REQUEST_URI" => "/ar/%D8%A7%D9%84%D8%AF%D9%88%D8%B1%D8%A7%D8%AA/reinforcement-learning-beginner-to-master---ai-in-python" "SCRIPT_NAME" => "/index.php" "CONTENT_LENGTH" => "" "CONTENT_TYPE" => "" "REQUEST_METHOD" => "GET" "QUERY_STRING" => "" "SCRIPT_FILENAME" => "/var/www/corspedia/public/index.php" "PATH_INFO" => "" "FCGI_ROLE" => "RESPONDER" "PHP_SELF" => "/index.php" "REQUEST_TIME_FLOAT" => 1747270880.5924 "REQUEST_TIME" => 1747270880 ]
        request_cookies
        []
        
        response_headers
        0 of 0
        array:5 [ "content-type" => array:1 [ 0 => "text/html; charset=UTF-8" ] "cache-control" => array:1 [ 0 => "no-cache, private" ] "date" => array:1 [ 0 => "Thu, 15 May 2025 01:01:20 GMT" ] "set-cookie" => array:2 [ 0 => "XSRF-TOKEN=eyJpdiI6Ildsa1FNS3YxN1piMEZ3Ni82TzlBMUE9PSIsInZhbHVlIjoiVWUyOVBaV243NTRJOCtJM0xlRmhhVmNER2ZIcnpxVEVnOWE3ejFGODd6UmVHbi9JY21xaC9pQWp1YTNwTTlGQncycEJyT0FZZlIrU2FuSWpqUHdSZGFUWmxrSU91STQzSU9SdW42T3VxRHZJYnZvSnI0UEpjRkRIZVFiOG1ncHIiLCJtYWMiOiI2ZWVjNGFmNTdhNzliZjA5MjdhMmQ3OWFjYTFkMTlhZDU0OWQ3MzFhODM5NzkyYzdiMDExYWVmZDY4ZWJmODIzIiwidGFnIjoiIn0%3D; expires=Thu, 15 May 2025 03:01:21 GMT; Max-Age=7200; path=/; samesite=laxXSRF-TOKEN=eyJpdiI6Ildsa1FNS3YxN1piMEZ3Ni82TzlBMUE9PSIsInZhbHVlIjoiVWUyOVBaV243NTRJOCtJM0xlRmhhVmNER2ZIcnpxVEVnOWE3ejFGODd6UmVHbi9JY21xaC9pQWp1YTNwTTlGQncycEJyT" 1 => "laravel_session=eyJpdiI6InhQNHlKL3R2NEF6ZFFLN3dIdlhrYkE9PSIsInZhbHVlIjoiYmlHQmNiZVVjaG5CSHZqNW4wWnhKUFpkbnhYOFVqalJhOVVTN2JmV1hRa2svcGRqNnBPNVcvQWc5SkhIU0VsWFE4RnlIT3RidlROUmNVU0FreGkrK21UY25qNzdQT2U1V1loWm1HSmRLMzEweXNxTGY5bkI3S3pLSzJLVERLWUIiLCJtYWMiOiJkZjEwODJlYzM1MTYxZjViNzkwYjg1OTM4OTc4OTZmZDhlNGY0NDVhYTQ5ZDkwZTlhMWU4Nzk3YTQyYjk5NzIzIiwidGFnIjoiIn0%3D; expires=Thu, 15 May 2025 03:01:21 GMT; Max-Age=7200; path=/; httponly; samesite=laxlaravel_session=eyJpdiI6InhQNHlKL3R2NEF6ZFFLN3dIdlhrYkE9PSIsInZhbHVlIjoiYmlHQmNiZVVjaG5CSHZqNW4wWnhKUFpkbnhYOFVqalJhOVVTN2JmV1hRa2svcGRqNnBPNVcvQWc5SkhIU0VsWFE4" ] "Set-Cookie" => array:2 [ 0 => "XSRF-TOKEN=eyJpdiI6Ildsa1FNS3YxN1piMEZ3Ni82TzlBMUE9PSIsInZhbHVlIjoiVWUyOVBaV243NTRJOCtJM0xlRmhhVmNER2ZIcnpxVEVnOWE3ejFGODd6UmVHbi9JY21xaC9pQWp1YTNwTTlGQncycEJyT0FZZlIrU2FuSWpqUHdSZGFUWmxrSU91STQzSU9SdW42T3VxRHZJYnZvSnI0UEpjRkRIZVFiOG1ncHIiLCJtYWMiOiI2ZWVjNGFmNTdhNzliZjA5MjdhMmQ3OWFjYTFkMTlhZDU0OWQ3MzFhODM5NzkyYzdiMDExYWVmZDY4ZWJmODIzIiwidGFnIjoiIn0%3D; expires=Thu, 15-May-2025 03:01:21 GMT; path=/XSRF-TOKEN=eyJpdiI6Ildsa1FNS3YxN1piMEZ3Ni82TzlBMUE9PSIsInZhbHVlIjoiVWUyOVBaV243NTRJOCtJM0xlRmhhVmNER2ZIcnpxVEVnOWE3ejFGODd6UmVHbi9JY21xaC9pQWp1YTNwTTlGQncycEJyT" 1 => "laravel_session=eyJpdiI6InhQNHlKL3R2NEF6ZFFLN3dIdlhrYkE9PSIsInZhbHVlIjoiYmlHQmNiZVVjaG5CSHZqNW4wWnhKUFpkbnhYOFVqalJhOVVTN2JmV1hRa2svcGRqNnBPNVcvQWc5SkhIU0VsWFE4RnlIT3RidlROUmNVU0FreGkrK21UY25qNzdQT2U1V1loWm1HSmRLMzEweXNxTGY5bkI3S3pLSzJLVERLWUIiLCJtYWMiOiJkZjEwODJlYzM1MTYxZjViNzkwYjg1OTM4OTc4OTZmZDhlNGY0NDVhYTQ5ZDkwZTlhMWU4Nzk3YTQyYjk5NzIzIiwidGFnIjoiIn0%3D; expires=Thu, 15-May-2025 03:01:21 GMT; path=/; httponlylaravel_session=eyJpdiI6InhQNHlKL3R2NEF6ZFFLN3dIdlhrYkE9PSIsInZhbHVlIjoiYmlHQmNiZVVjaG5CSHZqNW4wWnhKUFpkbnhYOFVqalJhOVVTN2JmV1hRa2svcGRqNnBPNVcvQWc5SkhIU0VsWFE4" ] ]
        session_attributes
        0 of 0
        array:5 [ "_token" => "XnWkqWLZZhigysEZq5n6FS8aWQd03bs5ohBgInlW" "locale" => "ar" "_previous" => array:1 [ "url" => "https://www.corspedia.com/ar/%D8%A7%D9%84%D8%AF%D9%88%D8%B1%D8%A7%D8%AA/reinforcement-learning-beginner-to-master---ai-in-python" ] "_flash" => array:2 [ "old" => [] "new" => [] ] "PHPDEBUGBAR_STACK_DATA" => [] ]