Introduction to Reinforcement Learning

Brought by: Coursera

Overview

The goal of «Intro to Reinforcement learning» is in its name: introduce students to reinforcement learning – the prominent area of modern research in artificial intelligence. The reinforcement learning differs much from both supervised and unsupervised learning and is more about how humans learn in reality.
Students will learn from this course both theoretical core and recent practical RL methods. Most importantly, they will learn how to apply such methods to practical problems. In six weeks students will be guided through the basics of Reinforcement Learning (RL): we will talk about essential theory of RL, value-based methods (such as SARSA and Q-learning), policy based algorithms and methods, designed to solve the optimal exploration problem. In addition to algorithms and theory, during the course we will also present useful practical tips and tricks, needed for learning stabilization, and study how to apply the methods to large scale problems with deep neural networks.

Taught by

Pavel Shvechikov and Alexander Panin

Introduction to Reinforcement Learning
Go to course

Introduction to Reinforcement Learning

Brought by: Coursera

  • Coursera
  • Free
  • English
  • Certificate Available
  • Certain days
  • All
  • N/A
8.1.2PHP Version306msRequest Duration2MBMemory UsageGET en/courses/{slug}Route
    • Booting (176ms)
    • Application (130ms)
    • 1 x Booting (57.5%)
      176.15ms
      1 x Application (42.27%)
      129.51ms
      14 templates were rendered
      • public.courses.show (resources/views/public/courses/show.blade.php)3bladefile
        Params
        0
        course
        1
        links
        2
        config
      • public.courses.partials.breadcrumbs (resources/views/public/courses/partials/breadcrumbs.blade.php)6bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
      • public.courses.partials.heading (resources/views/public/courses/partials/heading.blade.php)7bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
        6
        classes
      • public.courses.partials.details (resources/views/public/courses/partials/details.blade.php)6bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
      • public.courses.partials.breadcrumbs (resources/views/public/courses/partials/breadcrumbs.blade.php)6bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
      • public.courses.partials.heading (resources/views/public/courses/partials/heading.blade.php)7bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
        6
        classes
      • public.layouts.main (resources/views/public/layouts/main.blade.php)6bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
      • public.layouts.partials.meta (resources/views/public/layouts/partials/meta.blade.php)6bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
      • public.layouts.partials.navbar (resources/views/public/layouts/partials/navbar.blade.php)6bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
      • public.auth.profile.partials.links (resources/views/public/auth/profile/partials/links.blade.php)6bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
      • public.auth.profile.partials.link (resources/views/public/auth/profile/partials/link.blade.php)8bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
        6
        route
        7
        title
      • public.auth.profile.partials.link (resources/views/public/auth/profile/partials/link.blade.php)8bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
        6
        route
        7
        title
      • public.auth.profile.partials.link (resources/views/public/auth/profile/partials/link.blade.php)8bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
        6
        route
        7
        title
      • public.layouts.partials.flash-session (resources/views/public/layouts/partials/flash-session.blade.php)6bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
      uri
      GET en/courses/{slug}
      middleware
      web, localize:en
      controller
      App\Http\Controllers\CourseController@show
      as
      en.courses.show
      namespace
      prefix
      /en
      where
      file
      app/Http/Controllers/CourseController.php:17-35
      7 statements were executed9.3ms
      • select * from `courses` where `slug_en` = 'introduction-to-reinforcement-learning' limit 1
        7.65ms/app/Http/Controllers/CourseController.php:20corspedia
        Metadata
        Bindings
        • 0. introduction-to-reinforcement-learning
        Backtrace
        • 17. /app/Http/Controllers/CourseController.php:20
        • 18. /vendor/laravel/framework/src/Illuminate/Routing/Controller.php:54
        • 19. /vendor/laravel/framework/src/Illuminate/Routing/ControllerDispatcher.php:43
        • 20. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:260
        • 21. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:205
      • update `courses` set `visitors` = `visitors` + 1, `courses`.`updated_at` = '2025-06-04 06:22:03' where `id` = 2453
        650μs/app/Http/Controllers/CourseController.php:21corspedia
        Metadata
        Bindings
        • 0. 2025-06-04 06:22:03
        • 1. 2453
        Backtrace
        • 17. /app/Http/Controllers/CourseController.php:21
        • 18. /vendor/laravel/framework/src/Illuminate/Routing/Controller.php:54
        • 19. /vendor/laravel/framework/src/Illuminate/Routing/ControllerDispatcher.php:43
        • 20. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:260
        • 21. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:205
      • select `id`, `name_en`, `name_ar`, `topic_id`, `slug_en`, `slug_ar` from `subjects` where `subjects`.`id` in (4)
        210μs/app/Http/Controllers/CourseController.php:23corspedia
        Metadata
        Backtrace
        • 20. /app/Http/Controllers/CourseController.php:23
        • 21. /vendor/laravel/framework/src/Illuminate/Routing/Controller.php:54
        • 22. /vendor/laravel/framework/src/Illuminate/Routing/ControllerDispatcher.php:43
        • 23. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:260
        • 24. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:205
      • select `id`, `name_en`, `name_ar`, `slug_en`, `slug_ar` from `topics` where `topics`.`id` in (1)
        190μs/app/Http/Controllers/CourseController.php:23corspedia
        Metadata
        Backtrace
        • 25. /app/Http/Controllers/CourseController.php:23
        • 26. /vendor/laravel/framework/src/Illuminate/Routing/Controller.php:54
        • 27. /vendor/laravel/framework/src/Illuminate/Routing/ControllerDispatcher.php:43
        • 28. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:260
        • 29. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:205
      • select * from `institutions` where `institutions`.`id` in (35) and `institutions`.`deleted_at` is null
        220μs/app/Http/Controllers/CourseController.php:23corspedia
        Metadata
        Backtrace
        • 20. /app/Http/Controllers/CourseController.php:23
        • 21. /vendor/laravel/framework/src/Illuminate/Routing/Controller.php:54
        • 22. /vendor/laravel/framework/src/Illuminate/Routing/ControllerDispatcher.php:43
        • 23. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:260
        • 24. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:205
      • select * from `providers` where `providers`.`id` in (2) and `providers`.`deleted_at` is null
        170μs/app/Http/Controllers/CourseController.php:23corspedia
        Metadata
        Backtrace
        • 20. /app/Http/Controllers/CourseController.php:23
        • 21. /vendor/laravel/framework/src/Illuminate/Routing/Controller.php:54
        • 22. /vendor/laravel/framework/src/Illuminate/Routing/ControllerDispatcher.php:43
        • 23. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:260
        • 24. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:205
      • select * from `html_files` where `html_files`.`id` = 2444 limit 1
        210μs/app/Models/Course.php:84corspedia
        Metadata
        Bindings
        • 0. 2444
        Backtrace
        • 21. /app/Models/Course.php:84
        • 28. view::public.courses.show:29
        • 30. /vendor/laravel/framework/src/Illuminate/Filesystem/Filesystem.php:125
        • 31. /vendor/laravel/framework/src/Illuminate/View/Engines/PhpEngine.php:58
        • 32. /vendor/laravel/framework/src/Illuminate/View/Engines/CompilerEngine.php:72
      App\Models\HtmlFile
      1
      App\Models\Provider
      1
      App\Models\Institution
      1
      App\Models\Topic
      1
      App\Models\Subject
      1
      App\Models\Course
      1
        _token
        AIyjWxS6mIdnW90Vw1GHqn6PAL2F7ovJQ3af6P4R
        locale
        en
        _previous
        array:1 [ "url" => "https://www.corspedia.com/en/courses/introduction-to-reinforcement-learning" ]
        _flash
        array:2 [ "old" => [] "new" => [] ]
        PHPDEBUGBAR_STACK_DATA
        []
        path_info
        /en/courses/introduction-to-reinforcement-learning
        status_code
        200
        
        status_text
        OK
        format
        html
        content_type
        text/html; charset=UTF-8
        request_query
        []
        
        request_request
        []
        
        request_headers
        0 of 0
        array:24 [ "cf-ipcountry" => array:1 [ 0 => "US" ] "cf-connecting-ip" => array:1 [ 0 => "18.191.33.110" ] "cdn-loop" => array:1 [ 0 => "cloudflare; loops=1" ] "x-forwarded-proto" => array:1 [ 0 => "https" ] "x-forwarded-for" => array:1 [ 0 => "18.191.33.110" ] "sec-fetch-site" => array:1 [ 0 => "none" ] "accept" => array:1 [ 0 => "text/html,application/xhtml+xml,application/xml;q=0.9,image/avif,image/webp,image/apng,*/*;q=0.8,application/signed-exchange;v=b3;q=0.7" ] "user-agent" => array:1 [ 0 => "Mozilla/5.0 AppleWebKit/537.36 (KHTML, like Gecko; compatible; ClaudeBot/1.0; +claudebot@anthropic.com)" ] "upgrade-insecure-requests" => array:1 [ 0 => "1" ] "sec-ch-ua-platform" => array:1 [ 0 => ""Windows"" ] "sec-ch-ua-mobile" => array:1 [ 0 => "?0" ] "sec-ch-ua" => array:1 [ 0 => ""Chromium";v="130", "HeadlessChrome";v="130", "Not?A_Brand";v="99"" ] "cache-control" => array:1 [ 0 => "no-cache" ] "pragma" => array:1 [ 0 => "no-cache" ] "sec-fetch-dest" => array:1 [ 0 => "document" ] "cf-ray" => array:1 [ 0 => "94a555625a69e81a-ORD" ] "accept-encoding" => array:1 [ 0 => "gzip, br" ] "priority" => array:1 [ 0 => "u=0, i" ] "sec-fetch-user" => array:1 [ 0 => "?1" ] "sec-fetch-mode" => array:1 [ 0 => "navigate" ] "cf-visitor" => array:1 [ 0 => "{"scheme":"https"}" ] "host" => array:1 [ 0 => "www.corspedia.com" ] "content-length" => array:1 [ 0 => "" ] "content-type" => array:1 [ 0 => "" ] ]
        request_server
        0 of 0
        array:50 [ "USER" => "www-data" "HOME" => "/var/www" "HTTP_CF_IPCOUNTRY" => "US" "HTTP_CF_CONNECTING_IP" => "18.191.33.110" "HTTP_CDN_LOOP" => "cloudflare; loops=1" "HTTP_X_FORWARDED_PROTO" => "https" "HTTP_X_FORWARDED_FOR" => "18.191.33.110" "HTTP_SEC_FETCH_SITE" => "none" "HTTP_ACCEPT" => "text/html,application/xhtml+xml,application/xml;q=0.9,image/avif,image/webp,image/apng,*/*;q=0.8,application/signed-exchange;v=b3;q=0.7" "HTTP_USER_AGENT" => "Mozilla/5.0 AppleWebKit/537.36 (KHTML, like Gecko; compatible; ClaudeBot/1.0; +claudebot@anthropic.com)" "HTTP_UPGRADE_INSECURE_REQUESTS" => "1" "HTTP_SEC_CH_UA_PLATFORM" => ""Windows"" "HTTP_SEC_CH_UA_MOBILE" => "?0" "HTTP_SEC_CH_UA" => ""Chromium";v="130", "HeadlessChrome";v="130", "Not?A_Brand";v="99"" "HTTP_CACHE_CONTROL" => "no-cache" "HTTP_PRAGMA" => "no-cache" "HTTP_SEC_FETCH_DEST" => "document" "HTTP_CF_RAY" => "94a555625a69e81a-ORD" "HTTP_ACCEPT_ENCODING" => "gzip, br" "HTTP_PRIORITY" => "u=0, i" "HTTP_SEC_FETCH_USER" => "?1" "HTTP_SEC_FETCH_MODE" => "navigate" "HTTP_CF_VISITOR" => "{"scheme":"https"}" "HTTP_HOST" => "www.corspedia.com" "REDIRECT_STATUS" => "200" "SERVER_NAME" => "corspedia.com" "SERVER_PORT" => "443" "SERVER_ADDR" => "141.95.147.152" "REMOTE_USER" => "" "REMOTE_PORT" => "46674" "REMOTE_ADDR" => "172.71.254.173" "SERVER_SOFTWARE" => "nginx/1.18.0" "GATEWAY_INTERFACE" => "CGI/1.1" "HTTPS" => "on" "REQUEST_SCHEME" => "https" "SERVER_PROTOCOL" => "HTTP/2.0" "DOCUMENT_ROOT" => "/var/www/corspedia/public" "DOCUMENT_URI" => "/index.php" "REQUEST_URI" => "/en/courses/introduction-to-reinforcement-learning" "SCRIPT_NAME" => "/index.php" "CONTENT_LENGTH" => "" "CONTENT_TYPE" => "" "REQUEST_METHOD" => "GET" "QUERY_STRING" => "" "SCRIPT_FILENAME" => "/var/www/corspedia/public/index.php" "PATH_INFO" => "" "FCGI_ROLE" => "RESPONDER" "PHP_SELF" => "/index.php" "REQUEST_TIME_FLOAT" => 1749018122.8653 "REQUEST_TIME" => 1749018122 ]
        request_cookies
        []
        
        response_headers
        0 of 0
        array:5 [ "content-type" => array:1 [ 0 => "text/html; charset=UTF-8" ] "cache-control" => array:1 [ 0 => "no-cache, private" ] "date" => array:1 [ 0 => "Wed, 04 Jun 2025 06:22:03 GMT" ] "set-cookie" => array:2 [ 0 => "XSRF-TOKEN=eyJpdiI6IlVzdEZjd1JvZmZyZ1k1MlNna1ZxeXc9PSIsInZhbHVlIjoiTDYrc2V4MG4rL2FuR0ZwV3NtNFlsc2tDd21oeVUzNTZyNytNeGY5QU5KQjQ5aHQ5SkRXeHIxSkh2TmtidXN4MUQ1NDJGRFl5bC9kc0hJVGV4YjNNQTFPak9nK3R4Tnp4bFVIcUo2SHpVQ1UxRXR3ZkhqTTB5TU1sOUkvdzJYdVoiLCJtYWMiOiI3YTVkMzU4NTY2YjkzZjFkZWNmZjUzYjlhODEyZTc1YWY2NjE3ZDNjNWUzZDUzYjlkMTk3ZGFmYzc4NTI5NDhjIiwidGFnIjoiIn0%3D; expires=Wed, 04 Jun 2025 08:22:03 GMT; Max-Age=7200; path=/; samesite=laxXSRF-TOKEN=eyJpdiI6IlVzdEZjd1JvZmZyZ1k1MlNna1ZxeXc9PSIsInZhbHVlIjoiTDYrc2V4MG4rL2FuR0ZwV3NtNFlsc2tDd21oeVUzNTZyNytNeGY5QU5KQjQ5aHQ5SkRXeHIxSkh2TmtidXN4MUQ1NDJGR" 1 => "laravel_session=eyJpdiI6ImtLcG90SjRTd29mZGxKYnc1d1dmZ0E9PSIsInZhbHVlIjoiUXgvS0UxbnBmUldUNHR3dWZsYUtJNHh3N0p1RUluZ3F6STJQMjhNM1UxNVhlV1ZNTitPNUJHalQ0WS9vdFRrdm1YZHRYczJLRXl4UmhqMHNnUlRIMUEwVDd4K1p5dUZtUlBaR2hzYXZoWjVRb0l4VVdpUDVBV0lSZkZwbjcvQkwiLCJtYWMiOiI3ZDA4N2E5YmFkNjQ3YTU5NTIyN2MyOTQ1YmIxZTcwN2I4MjMzNDUyMDRiZDlhMjQzN2VmZTlkY2VhMDhiMWI5IiwidGFnIjoiIn0%3D; expires=Wed, 04 Jun 2025 08:22:03 GMT; Max-Age=7200; path=/; httponly; samesite=laxlaravel_session=eyJpdiI6ImtLcG90SjRTd29mZGxKYnc1d1dmZ0E9PSIsInZhbHVlIjoiUXgvS0UxbnBmUldUNHR3dWZsYUtJNHh3N0p1RUluZ3F6STJQMjhNM1UxNVhlV1ZNTitPNUJHalQ0WS9vdFRrdm1Y" ] "Set-Cookie" => array:2 [ 0 => "XSRF-TOKEN=eyJpdiI6IlVzdEZjd1JvZmZyZ1k1MlNna1ZxeXc9PSIsInZhbHVlIjoiTDYrc2V4MG4rL2FuR0ZwV3NtNFlsc2tDd21oeVUzNTZyNytNeGY5QU5KQjQ5aHQ5SkRXeHIxSkh2TmtidXN4MUQ1NDJGRFl5bC9kc0hJVGV4YjNNQTFPak9nK3R4Tnp4bFVIcUo2SHpVQ1UxRXR3ZkhqTTB5TU1sOUkvdzJYdVoiLCJtYWMiOiI3YTVkMzU4NTY2YjkzZjFkZWNmZjUzYjlhODEyZTc1YWY2NjE3ZDNjNWUzZDUzYjlkMTk3ZGFmYzc4NTI5NDhjIiwidGFnIjoiIn0%3D; expires=Wed, 04-Jun-2025 08:22:03 GMT; path=/XSRF-TOKEN=eyJpdiI6IlVzdEZjd1JvZmZyZ1k1MlNna1ZxeXc9PSIsInZhbHVlIjoiTDYrc2V4MG4rL2FuR0ZwV3NtNFlsc2tDd21oeVUzNTZyNytNeGY5QU5KQjQ5aHQ5SkRXeHIxSkh2TmtidXN4MUQ1NDJGR" 1 => "laravel_session=eyJpdiI6ImtLcG90SjRTd29mZGxKYnc1d1dmZ0E9PSIsInZhbHVlIjoiUXgvS0UxbnBmUldUNHR3dWZsYUtJNHh3N0p1RUluZ3F6STJQMjhNM1UxNVhlV1ZNTitPNUJHalQ0WS9vdFRrdm1YZHRYczJLRXl4UmhqMHNnUlRIMUEwVDd4K1p5dUZtUlBaR2hzYXZoWjVRb0l4VVdpUDVBV0lSZkZwbjcvQkwiLCJtYWMiOiI3ZDA4N2E5YmFkNjQ3YTU5NTIyN2MyOTQ1YmIxZTcwN2I4MjMzNDUyMDRiZDlhMjQzN2VmZTlkY2VhMDhiMWI5IiwidGFnIjoiIn0%3D; expires=Wed, 04-Jun-2025 08:22:03 GMT; path=/; httponlylaravel_session=eyJpdiI6ImtLcG90SjRTd29mZGxKYnc1d1dmZ0E9PSIsInZhbHVlIjoiUXgvS0UxbnBmUldUNHR3dWZsYUtJNHh3N0p1RUluZ3F6STJQMjhNM1UxNVhlV1ZNTitPNUJHalQ0WS9vdFRrdm1Y" ] ]
        session_attributes
        0 of 0
        array:5 [ "_token" => "AIyjWxS6mIdnW90Vw1GHqn6PAL2F7ovJQ3af6P4R" "locale" => "en" "_previous" => array:1 [ "url" => "https://www.corspedia.com/en/courses/introduction-to-reinforcement-learning" ] "_flash" => array:2 [ "old" => [] "new" => [] ] "PHPDEBUGBAR_STACK_DATA" => [] ]