Reinforcement Learning

Brought by: Brilliant

Overview

This course was written by Tessa van der Heiden, a researcher and developer of autonomous driving algorithms at BMW.

In this course, you'll learn the mathematical underpinnings of reinforcement learning, a foundational machine learning technique in which an agent (or algorithm) is trained by trial and error. By rewarding the agent for good outcomes, it "learns" optimal strategies, which can be applied to problems in domains like robotics, quantitative trading, and game theory.
This course is intended for young professionals who are interested in applying machine learning techniques for decision making, or students who are pursuing a machine learning career or preparing for interviews.

Syllabus

  • Introduction:
    • Introduction: How does a computer devise a strategy to play a game optimally?
  • Foundations:
    • Value Functions: When transitioning between various options, the algorithm must quantify how good these options are.
    • Dynamic Programming: Optimize an interconnected system by reducing it into smaller systems.
    • Monte Carlo: If we make random moves a large number of times, we might notice a pattern that allows us to solve the problem deterministically.
  • Extensions:
    • Temporal Difference Learning: Explore a method of reinforcement learning that updates every time step — not just at the end of the episode.
    • Policy Gradient Methods: These methods take a different approach — by learning the optimal policy directly.
Reinforcement Learning
Go to course

Reinforcement Learning

Brought by: Brilliant

  • Brilliant
  • Paid
  • English
  • Certificate Not Available
  • Certain days
  • beginner
  • N/A
8.1.2PHP Version911msRequest Duration2MBMemory UsageGET en/courses/{slug}Route
    • Booting (579ms)
    • Application (330ms)
    • 1 x Booting (63.53%)
      578.99ms
      1 x Application (36.22%)
      330.12ms
      14 templates were rendered
      • public.courses.show (resources/views/public/courses/show.blade.php)3bladefile
        Params
        0
        course
        1
        links
        2
        config
      • public.courses.partials.breadcrumbs (resources/views/public/courses/partials/breadcrumbs.blade.php)6bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
      • public.courses.partials.heading (resources/views/public/courses/partials/heading.blade.php)7bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
        6
        classes
      • public.courses.partials.details (resources/views/public/courses/partials/details.blade.php)6bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
      • public.courses.partials.breadcrumbs (resources/views/public/courses/partials/breadcrumbs.blade.php)6bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
      • public.courses.partials.heading (resources/views/public/courses/partials/heading.blade.php)7bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
        6
        classes
      • public.layouts.main (resources/views/public/layouts/main.blade.php)6bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
      • public.layouts.partials.meta (resources/views/public/layouts/partials/meta.blade.php)6bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
      • public.layouts.partials.navbar (resources/views/public/layouts/partials/navbar.blade.php)6bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
      • public.auth.profile.partials.links (resources/views/public/auth/profile/partials/links.blade.php)6bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
      • public.auth.profile.partials.link (resources/views/public/auth/profile/partials/link.blade.php)8bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
        6
        route
        7
        title
      • public.auth.profile.partials.link (resources/views/public/auth/profile/partials/link.blade.php)8bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
        6
        route
        7
        title
      • public.auth.profile.partials.link (resources/views/public/auth/profile/partials/link.blade.php)8bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
        6
        route
        7
        title
      • public.layouts.partials.flash-session (resources/views/public/layouts/partials/flash-session.blade.php)6bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
      uri
      GET en/courses/{slug}
      middleware
      web, localize:en
      controller
      App\Http\Controllers\CourseController@show
      as
      en.courses.show
      namespace
      prefix
      /en
      where
      file
      app/Http/Controllers/CourseController.php:17-35
      6 statements were executed11.07ms
      • select * from `courses` where `slug_en` = 'reinforcement-learningm9H' limit 1
        9.28ms/app/Http/Controllers/CourseController.php:20corspedia
        Metadata
        Bindings
        • 0. reinforcement-learningm9H
        Backtrace
        • 17. /app/Http/Controllers/CourseController.php:20
        • 18. /vendor/laravel/framework/src/Illuminate/Routing/Controller.php:54
        • 19. /vendor/laravel/framework/src/Illuminate/Routing/ControllerDispatcher.php:43
        • 20. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:260
        • 21. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:205
      • update `courses` set `visitors` = `visitors` + 1, `courses`.`updated_at` = '2025-07-24 17:22:22' where `id` = 2325
        820μs/app/Http/Controllers/CourseController.php:21corspedia
        Metadata
        Bindings
        • 0. 2025-07-24 17:22:22
        • 1. 2325
        Backtrace
        • 17. /app/Http/Controllers/CourseController.php:21
        • 18. /vendor/laravel/framework/src/Illuminate/Routing/Controller.php:54
        • 19. /vendor/laravel/framework/src/Illuminate/Routing/ControllerDispatcher.php:43
        • 20. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:260
        • 21. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:205
      • select `id`, `name_en`, `name_ar`, `topic_id`, `slug_en`, `slug_ar` from `subjects` where `subjects`.`id` in (4)
        240μs/app/Http/Controllers/CourseController.php:23corspedia
        Metadata
        Backtrace
        • 20. /app/Http/Controllers/CourseController.php:23
        • 21. /vendor/laravel/framework/src/Illuminate/Routing/Controller.php:54
        • 22. /vendor/laravel/framework/src/Illuminate/Routing/ControllerDispatcher.php:43
        • 23. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:260
        • 24. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:205
      • select `id`, `name_en`, `name_ar`, `slug_en`, `slug_ar` from `topics` where `topics`.`id` in (1)
        220μs/app/Http/Controllers/CourseController.php:23corspedia
        Metadata
        Backtrace
        • 25. /app/Http/Controllers/CourseController.php:23
        • 26. /vendor/laravel/framework/src/Illuminate/Routing/Controller.php:54
        • 27. /vendor/laravel/framework/src/Illuminate/Routing/ControllerDispatcher.php:43
        • 28. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:260
        • 29. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:205
      • select * from `providers` where `providers`.`id` in (40) and `providers`.`deleted_at` is null
        250μs/app/Http/Controllers/CourseController.php:23corspedia
        Metadata
        Backtrace
        • 20. /app/Http/Controllers/CourseController.php:23
        • 21. /vendor/laravel/framework/src/Illuminate/Routing/Controller.php:54
        • 22. /vendor/laravel/framework/src/Illuminate/Routing/ControllerDispatcher.php:43
        • 23. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:260
        • 24. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:205
      • select * from `html_files` where `html_files`.`id` = 2316 limit 1
        260μs/app/Models/Course.php:84corspedia
        Metadata
        Bindings
        • 0. 2316
        Backtrace
        • 21. /app/Models/Course.php:84
        • 28. view::public.courses.show:29
        • 30. /vendor/laravel/framework/src/Illuminate/Filesystem/Filesystem.php:125
        • 31. /vendor/laravel/framework/src/Illuminate/View/Engines/PhpEngine.php:58
        • 32. /vendor/laravel/framework/src/Illuminate/View/Engines/CompilerEngine.php:72
      App\Models\HtmlFile
      1
      App\Models\Provider
      1
      App\Models\Topic
      1
      App\Models\Subject
      1
      App\Models\Course
      1
        _token
        uJAErwynv1fxMxoVXzKHw2W0m0gZU5wydeiWNecX
        locale
        en
        _previous
        array:1 [ "url" => "https://www.corspedia.com/en/courses/reinforcement-learningm9H" ]
        _flash
        array:2 [ "old" => [] "new" => [] ]
        PHPDEBUGBAR_STACK_DATA
        []
        path_info
        /en/courses/reinforcement-learningm9H
        status_code
        200
        
        status_text
        OK
        format
        html
        content_type
        text/html; charset=UTF-8
        request_query
        []
        
        request_request
        []
        
        request_headers
        0 of 0
        array:24 [ "cf-ipcountry" => array:1 [ 0 => "US" ] "cf-connecting-ip" => array:1 [ 0 => "216.73.216.121" ] "cdn-loop" => array:1 [ 0 => "cloudflare; loops=1" ] "x-forwarded-proto" => array:1 [ 0 => "https" ] "x-forwarded-for" => array:1 [ 0 => "216.73.216.121" ] "sec-fetch-site" => array:1 [ 0 => "none" ] "accept" => array:1 [ 0 => "text/html,application/xhtml+xml,application/xml;q=0.9,image/avif,image/webp,image/apng,*/*;q=0.8,application/signed-exchange;v=b3;q=0.7" ] "user-agent" => array:1 [ 0 => "Mozilla/5.0 AppleWebKit/537.36 (KHTML, like Gecko; compatible; ClaudeBot/1.0; +claudebot@anthropic.com)" ] "upgrade-insecure-requests" => array:1 [ 0 => "1" ] "sec-ch-ua-platform" => array:1 [ 0 => ""Windows"" ] "sec-ch-ua-mobile" => array:1 [ 0 => "?0" ] "sec-ch-ua" => array:1 [ 0 => ""Chromium";v="130", "HeadlessChrome";v="130", "Not?A_Brand";v="99"" ] "cache-control" => array:1 [ 0 => "no-cache" ] "pragma" => array:1 [ 0 => "no-cache" ] "sec-fetch-dest" => array:1 [ 0 => "document" ] "cf-ray" => array:1 [ 0 => "9645196689152327-ORD" ] "accept-encoding" => array:1 [ 0 => "gzip, br" ] "priority" => array:1 [ 0 => "u=0, i" ] "sec-fetch-user" => array:1 [ 0 => "?1" ] "sec-fetch-mode" => array:1 [ 0 => "navigate" ] "cf-visitor" => array:1 [ 0 => "{"scheme":"https"}" ] "host" => array:1 [ 0 => "www.corspedia.com" ] "content-length" => array:1 [ 0 => "" ] "content-type" => array:1 [ 0 => "" ] ]
        request_server
        0 of 0
        array:50 [ "USER" => "www-data" "HOME" => "/var/www" "HTTP_CF_IPCOUNTRY" => "US" "HTTP_CF_CONNECTING_IP" => "216.73.216.121" "HTTP_CDN_LOOP" => "cloudflare; loops=1" "HTTP_X_FORWARDED_PROTO" => "https" "HTTP_X_FORWARDED_FOR" => "216.73.216.121" "HTTP_SEC_FETCH_SITE" => "none" "HTTP_ACCEPT" => "text/html,application/xhtml+xml,application/xml;q=0.9,image/avif,image/webp,image/apng,*/*;q=0.8,application/signed-exchange;v=b3;q=0.7" "HTTP_USER_AGENT" => "Mozilla/5.0 AppleWebKit/537.36 (KHTML, like Gecko; compatible; ClaudeBot/1.0; +claudebot@anthropic.com)" "HTTP_UPGRADE_INSECURE_REQUESTS" => "1" "HTTP_SEC_CH_UA_PLATFORM" => ""Windows"" "HTTP_SEC_CH_UA_MOBILE" => "?0" "HTTP_SEC_CH_UA" => ""Chromium";v="130", "HeadlessChrome";v="130", "Not?A_Brand";v="99"" "HTTP_CACHE_CONTROL" => "no-cache" "HTTP_PRAGMA" => "no-cache" "HTTP_SEC_FETCH_DEST" => "document" "HTTP_CF_RAY" => "9645196689152327-ORD" "HTTP_ACCEPT_ENCODING" => "gzip, br" "HTTP_PRIORITY" => "u=0, i" "HTTP_SEC_FETCH_USER" => "?1" "HTTP_SEC_FETCH_MODE" => "navigate" "HTTP_CF_VISITOR" => "{"scheme":"https"}" "HTTP_HOST" => "www.corspedia.com" "REDIRECT_STATUS" => "200" "SERVER_NAME" => "corspedia.com" "SERVER_PORT" => "443" "SERVER_ADDR" => "141.95.147.152" "REMOTE_USER" => "" "REMOTE_PORT" => "31064" "REMOTE_ADDR" => "172.71.255.97" "SERVER_SOFTWARE" => "nginx/1.18.0" "GATEWAY_INTERFACE" => "CGI/1.1" "HTTPS" => "on" "REQUEST_SCHEME" => "https" "SERVER_PROTOCOL" => "HTTP/2.0" "DOCUMENT_ROOT" => "/var/www/corspedia/public" "DOCUMENT_URI" => "/index.php" "REQUEST_URI" => "/en/courses/reinforcement-learningm9H" "SCRIPT_NAME" => "/index.php" "CONTENT_LENGTH" => "" "CONTENT_TYPE" => "" "REQUEST_METHOD" => "GET" "QUERY_STRING" => "" "SCRIPT_FILENAME" => "/var/www/corspedia/public/index.php" "PATH_INFO" => "" "FCGI_ROLE" => "RESPONDER" "PHP_SELF" => "/index.php" "REQUEST_TIME_FLOAT" => 1753377742.0974 "REQUEST_TIME" => 1753377742 ]
        request_cookies
        []
        
        response_headers
        0 of 0
        array:5 [ "content-type" => array:1 [ 0 => "text/html; charset=UTF-8" ] "cache-control" => array:1 [ 0 => "no-cache, private" ] "date" => array:1 [ 0 => "Thu, 24 Jul 2025 17:22:22 GMT" ] "set-cookie" => array:2 [ 0 => "XSRF-TOKEN=eyJpdiI6Ikc2YzNBTUJHZ3JBeDNZMEFJRkpzQlE9PSIsInZhbHVlIjoiUkJoRURCNVFNQzVlQmFsc1Vwc2kraEd0eGprV2xRMkhlK1ErOE1CcEdkYXB5SHNxemp0UDdNdXcybDFrMnAyN2JaNUNoM1JIU3JzTEFCZXB1aDB3R3JyYThPRXVCR3dsMW1BemMyNE8vWHlWNGhhZzJpbUpaS0VuNkNGNENISk4iLCJtYWMiOiIwNzMwZjE4ZDYwM2I3OTU3OTczYzMwYjljOTc2MzMzMmNmYTVhNzA3OTljNjEwZGY5MDFlMmRkZmFlODljYjA0IiwidGFnIjoiIn0%3D; expires=Thu, 24 Jul 2025 19:22:22 GMT; Max-Age=7199; path=/; samesite=laxXSRF-TOKEN=eyJpdiI6Ikc2YzNBTUJHZ3JBeDNZMEFJRkpzQlE9PSIsInZhbHVlIjoiUkJoRURCNVFNQzVlQmFsc1Vwc2kraEd0eGprV2xRMkhlK1ErOE1CcEdkYXB5SHNxemp0UDdNdXcybDFrMnAyN2JaNUNoM" 1 => "laravel_session=eyJpdiI6ImN6dThiUndIcnZ6K2R5MWlFY0pXbEE9PSIsInZhbHVlIjoiNm51aDZTZm8xZXkxSmxHblVCckFiOUtRNUdFc2xiNEMrU3FXVGxyQTcrTmhvOVIwV0NGQ010azVFNzdpSVIyTWowZExTalRVS0ZXNStqMktkcmRWZnpjdnJYRnJqYWVINHQ0aDlHUXdzWVRDZ1VBODJnaUN6M0IzK1h3aGtPdlMiLCJtYWMiOiJiODNiYmIwMzRjYzQwZjdjN2Q1ZGUyZjdmNmQzNzVjZWFmOTUzOGE3OTgxOTIzZDg3YmUzMGFkZTAwY2Y5YmZkIiwidGFnIjoiIn0%3D; expires=Thu, 24 Jul 2025 19:22:22 GMT; Max-Age=7199; path=/; httponly; samesite=laxlaravel_session=eyJpdiI6ImN6dThiUndIcnZ6K2R5MWlFY0pXbEE9PSIsInZhbHVlIjoiNm51aDZTZm8xZXkxSmxHblVCckFiOUtRNUdFc2xiNEMrU3FXVGxyQTcrTmhvOVIwV0NGQ010azVFNzdpSVIyTWow" ] "Set-Cookie" => array:2 [ 0 => "XSRF-TOKEN=eyJpdiI6Ikc2YzNBTUJHZ3JBeDNZMEFJRkpzQlE9PSIsInZhbHVlIjoiUkJoRURCNVFNQzVlQmFsc1Vwc2kraEd0eGprV2xRMkhlK1ErOE1CcEdkYXB5SHNxemp0UDdNdXcybDFrMnAyN2JaNUNoM1JIU3JzTEFCZXB1aDB3R3JyYThPRXVCR3dsMW1BemMyNE8vWHlWNGhhZzJpbUpaS0VuNkNGNENISk4iLCJtYWMiOiIwNzMwZjE4ZDYwM2I3OTU3OTczYzMwYjljOTc2MzMzMmNmYTVhNzA3OTljNjEwZGY5MDFlMmRkZmFlODljYjA0IiwidGFnIjoiIn0%3D; expires=Thu, 24-Jul-2025 19:22:22 GMT; path=/XSRF-TOKEN=eyJpdiI6Ikc2YzNBTUJHZ3JBeDNZMEFJRkpzQlE9PSIsInZhbHVlIjoiUkJoRURCNVFNQzVlQmFsc1Vwc2kraEd0eGprV2xRMkhlK1ErOE1CcEdkYXB5SHNxemp0UDdNdXcybDFrMnAyN2JaNUNoM" 1 => "laravel_session=eyJpdiI6ImN6dThiUndIcnZ6K2R5MWlFY0pXbEE9PSIsInZhbHVlIjoiNm51aDZTZm8xZXkxSmxHblVCckFiOUtRNUdFc2xiNEMrU3FXVGxyQTcrTmhvOVIwV0NGQ010azVFNzdpSVIyTWowZExTalRVS0ZXNStqMktkcmRWZnpjdnJYRnJqYWVINHQ0aDlHUXdzWVRDZ1VBODJnaUN6M0IzK1h3aGtPdlMiLCJtYWMiOiJiODNiYmIwMzRjYzQwZjdjN2Q1ZGUyZjdmNmQzNzVjZWFmOTUzOGE3OTgxOTIzZDg3YmUzMGFkZTAwY2Y5YmZkIiwidGFnIjoiIn0%3D; expires=Thu, 24-Jul-2025 19:22:22 GMT; path=/; httponlylaravel_session=eyJpdiI6ImN6dThiUndIcnZ6K2R5MWlFY0pXbEE9PSIsInZhbHVlIjoiNm51aDZTZm8xZXkxSmxHblVCckFiOUtRNUdFc2xiNEMrU3FXVGxyQTcrTmhvOVIwV0NGQ010azVFNzdpSVIyTWow" ] ]
        session_attributes
        0 of 0
        array:5 [ "_token" => "uJAErwynv1fxMxoVXzKHw2W0m0gZU5wydeiWNecX" "locale" => "en" "_previous" => array:1 [ "url" => "https://www.corspedia.com/en/courses/reinforcement-learningm9H" ] "_flash" => array:2 [ "old" => [] "new" => [] ] "PHPDEBUGBAR_STACK_DATA" => [] ]