Information Extraction from Free Text Data in Health

Brought by: Coursera

Overview

In this MOOC, you will be introduced to advanced machine learning and natural language
processing techniques to parse and extract information from unstructured text documents in
healthcare, such as clinical notes, radiology reports, and discharge summaries. Whether you are an aspiring data scientist or an early or mid-career professional in data science or information technology in healthcare, it is critical that you keep up-to-date your skills in information extraction and analysis.

To be successful in this course, you should build on the concepts learned through other intermediate-level MOOC courses and specializations in Data Science offered by the University of Michigan, so you will be able to delve deeper into challenges in recognizing medical entities in health-related documents, extracting clinical information, addressing ambiguity and polysemy to tag them with correct concept types, and develop tools and techniques to analyze new genres of health information.

By the end of this course, you will be able to:
Identify text mining approaches needed to identify and extract different kinds of information from health-related text data
Create an end-to-end NLP pipeline to extract medical concepts from clinical free text using one terminology resource
Differentiate how training deep learning models differ from training traditional machine learning models
Configure a deep neural network model to detect adverse events from drug reviews
List the pros and cons of Deep Learning approaches."

Syllabus

  • Week 1 | What is Information Extraction?
    • Welcome to Week 1! We start this week by getting familiar with the process of information extraction. We will see specific techniques, such as regular expressions to extract information. We will also cover several evaluation approaches for information extraction. Let's get started!
  • Week 2 | Named Entity Recognition (NER)
    • Welcome to Week 2! We continue exploring information extraction methods and processes this week. We will learn about terminology resources available for medical concepts, and using these resources, develop an end-to-end pipeline to extract text fields from health text. Let's get started!
  • Week 3 | Sequential Classification
    • Welcome to Week 3! This week, we will learn how to formulate medical information extraction as a sequential classification task. In doing so, we will learn how to use an annotated clinical text dataset, to train a machine learning model. Let's get started!
  • Week 4 | Introduction to Advanced Approaches to NER in Health
    • Welcome to Week 4! We end our course by exploring advanced methods in information extraction using AI tools. Specifically, we will learn about neural network model to identify medical concepts from clinical text, and how to apply a trained machine learning model for a medical information extraction task. Let's get started!

Taught by

V. G. Vinod Vydiswaran

Information Extraction from Free Text Data in Health
Go to course

Information Extraction from Free Text Data in Health

Brought by: Coursera

  • Coursera
  • Free
  • English
  • Certificate Available
  • Available at any time
  • intermediate
  • English, Kazakh, German, Hindi, Russian, Swedish, Korean, Portuguese, Greek, Italian, French, Chinese, Spanish, Arabic, Thai, Ukrainian, Japanese, Indonesian, Polish, Dutch, Turkish
8.1.2PHP Version918msRequest Duration2MBMemory UsageGET en/courses/{slug}Route
    • Booting (598ms)
    • Application (318ms)
    • 1 x Booting (65.13%)
      597.85ms
      1 x Application (34.63%)
      317.85ms
      14 templates were rendered
      • public.courses.show (resources/views/public/courses/show.blade.php)3bladefile
        Params
        0
        course
        1
        links
        2
        config
      • public.courses.partials.breadcrumbs (resources/views/public/courses/partials/breadcrumbs.blade.php)6bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
      • public.courses.partials.heading (resources/views/public/courses/partials/heading.blade.php)7bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
        6
        classes
      • public.courses.partials.details (resources/views/public/courses/partials/details.blade.php)6bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
      • public.courses.partials.breadcrumbs (resources/views/public/courses/partials/breadcrumbs.blade.php)6bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
      • public.courses.partials.heading (resources/views/public/courses/partials/heading.blade.php)7bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
        6
        classes
      • public.layouts.main (resources/views/public/layouts/main.blade.php)6bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
      • public.layouts.partials.meta (resources/views/public/layouts/partials/meta.blade.php)6bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
      • public.layouts.partials.navbar (resources/views/public/layouts/partials/navbar.blade.php)6bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
      • public.auth.profile.partials.links (resources/views/public/auth/profile/partials/links.blade.php)6bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
      • public.auth.profile.partials.link (resources/views/public/auth/profile/partials/link.blade.php)8bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
        6
        route
        7
        title
      • public.auth.profile.partials.link (resources/views/public/auth/profile/partials/link.blade.php)8bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
        6
        route
        7
        title
      • public.auth.profile.partials.link (resources/views/public/auth/profile/partials/link.blade.php)8bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
        6
        route
        7
        title
      • public.layouts.partials.flash-session (resources/views/public/layouts/partials/flash-session.blade.php)6bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
      uri
      GET en/courses/{slug}
      middleware
      web, localize:en
      controller
      App\Http\Controllers\CourseController@show
      as
      en.courses.show
      namespace
      prefix
      /en
      where
      file
      app/Http/Controllers/CourseController.php:17-35
      7 statements were executed7.98ms
      • select * from `courses` where `slug_en` = 'information-extraction-from-free-text-data-in-health' limit 1
        6.42ms/app/Http/Controllers/CourseController.php:20corspedia
        Metadata
        Bindings
        • 0. information-extraction-from-free-text-data-in-health
        Backtrace
        • 17. /app/Http/Controllers/CourseController.php:20
        • 18. /vendor/laravel/framework/src/Illuminate/Routing/Controller.php:54
        • 19. /vendor/laravel/framework/src/Illuminate/Routing/ControllerDispatcher.php:43
        • 20. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:260
        • 21. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:205
      • update `courses` set `visitors` = `visitors` + 1, `courses`.`updated_at` = '2025-06-11 06:07:34' where `id` = 580
        570μs/app/Http/Controllers/CourseController.php:21corspedia
        Metadata
        Bindings
        • 0. 2025-06-11 06:07:34
        • 1. 580
        Backtrace
        • 17. /app/Http/Controllers/CourseController.php:21
        • 18. /vendor/laravel/framework/src/Illuminate/Routing/Controller.php:54
        • 19. /vendor/laravel/framework/src/Illuminate/Routing/ControllerDispatcher.php:43
        • 20. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:260
        • 21. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:205
      • select `id`, `name_en`, `name_ar`, `topic_id`, `slug_en`, `slug_ar` from `subjects` where `subjects`.`id` in (4)
        200μs/app/Http/Controllers/CourseController.php:23corspedia
        Metadata
        Backtrace
        • 20. /app/Http/Controllers/CourseController.php:23
        • 21. /vendor/laravel/framework/src/Illuminate/Routing/Controller.php:54
        • 22. /vendor/laravel/framework/src/Illuminate/Routing/ControllerDispatcher.php:43
        • 23. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:260
        • 24. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:205
      • select `id`, `name_en`, `name_ar`, `slug_en`, `slug_ar` from `topics` where `topics`.`id` in (1)
        190μs/app/Http/Controllers/CourseController.php:23corspedia
        Metadata
        Backtrace
        • 25. /app/Http/Controllers/CourseController.php:23
        • 26. /vendor/laravel/framework/src/Illuminate/Routing/Controller.php:54
        • 27. /vendor/laravel/framework/src/Illuminate/Routing/ControllerDispatcher.php:43
        • 28. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:260
        • 29. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:205
      • select * from `institutions` where `institutions`.`id` in (11) and `institutions`.`deleted_at` is null
        200μs/app/Http/Controllers/CourseController.php:23corspedia
        Metadata
        Backtrace
        • 20. /app/Http/Controllers/CourseController.php:23
        • 21. /vendor/laravel/framework/src/Illuminate/Routing/Controller.php:54
        • 22. /vendor/laravel/framework/src/Illuminate/Routing/ControllerDispatcher.php:43
        • 23. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:260
        • 24. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:205
      • select * from `providers` where `providers`.`id` in (2) and `providers`.`deleted_at` is null
        180μs/app/Http/Controllers/CourseController.php:23corspedia
        Metadata
        Backtrace
        • 20. /app/Http/Controllers/CourseController.php:23
        • 21. /vendor/laravel/framework/src/Illuminate/Routing/Controller.php:54
        • 22. /vendor/laravel/framework/src/Illuminate/Routing/ControllerDispatcher.php:43
        • 23. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:260
        • 24. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:205
      • select * from `html_files` where `html_files`.`id` = 574 limit 1
        220μs/app/Models/Course.php:84corspedia
        Metadata
        Bindings
        • 0. 574
        Backtrace
        • 21. /app/Models/Course.php:84
        • 28. view::public.courses.show:29
        • 30. /vendor/laravel/framework/src/Illuminate/Filesystem/Filesystem.php:125
        • 31. /vendor/laravel/framework/src/Illuminate/View/Engines/PhpEngine.php:58
        • 32. /vendor/laravel/framework/src/Illuminate/View/Engines/CompilerEngine.php:72
      App\Models\HtmlFile
      1
      App\Models\Provider
      1
      App\Models\Institution
      1
      App\Models\Topic
      1
      App\Models\Subject
      1
      App\Models\Course
      1
        _token
        nHJmH3RLAsUC2Q5nvaeE9iQhJDyd4ATLRuDu1gvD
        locale
        en
        _previous
        array:1 [ "url" => "https://www.corspedia.com/en/courses/information-extraction-from-free-text-dat...
        _flash
        array:2 [ "old" => [] "new" => [] ]
        PHPDEBUGBAR_STACK_DATA
        []
        path_info
        /en/courses/information-extraction-from-free-text-data-in-health
        status_code
        200
        
        status_text
        OK
        format
        html
        content_type
        text/html; charset=UTF-8
        request_query
        []
        
        request_request
        []
        
        request_headers
        0 of 0
        array:24 [ "cf-ipcountry" => array:1 [ 0 => "US" ] "cf-connecting-ip" => array:1 [ 0 => "216.73.216.84" ] "cdn-loop" => array:1 [ 0 => "cloudflare; loops=1" ] "x-forwarded-proto" => array:1 [ 0 => "https" ] "x-forwarded-for" => array:1 [ 0 => "216.73.216.84" ] "sec-fetch-site" => array:1 [ 0 => "none" ] "accept" => array:1 [ 0 => "text/html,application/xhtml+xml,application/xml;q=0.9,image/avif,image/webp,image/apng,*/*;q=0.8,application/signed-exchange;v=b3;q=0.7" ] "user-agent" => array:1 [ 0 => "Mozilla/5.0 AppleWebKit/537.36 (KHTML, like Gecko; compatible; ClaudeBot/1.0; +claudebot@anthropic.com)" ] "upgrade-insecure-requests" => array:1 [ 0 => "1" ] "sec-ch-ua-platform" => array:1 [ 0 => ""Windows"" ] "sec-ch-ua-mobile" => array:1 [ 0 => "?0" ] "sec-ch-ua" => array:1 [ 0 => ""Chromium";v="130", "HeadlessChrome";v="130", "Not?A_Brand";v="99"" ] "cache-control" => array:1 [ 0 => "no-cache" ] "pragma" => array:1 [ 0 => "no-cache" ] "sec-fetch-dest" => array:1 [ 0 => "document" ] "cf-ray" => array:1 [ 0 => "94deedccfc488737-ORD" ] "accept-encoding" => array:1 [ 0 => "gzip, br" ] "priority" => array:1 [ 0 => "u=0, i" ] "sec-fetch-user" => array:1 [ 0 => "?1" ] "sec-fetch-mode" => array:1 [ 0 => "navigate" ] "cf-visitor" => array:1 [ 0 => "{"scheme":"https"}" ] "host" => array:1 [ 0 => "www.corspedia.com" ] "content-length" => array:1 [ 0 => "" ] "content-type" => array:1 [ 0 => "" ] ]
        request_server
        0 of 0
        array:50 [ "USER" => "www-data" "HOME" => "/var/www" "HTTP_CF_IPCOUNTRY" => "US" "HTTP_CF_CONNECTING_IP" => "216.73.216.84" "HTTP_CDN_LOOP" => "cloudflare; loops=1" "HTTP_X_FORWARDED_PROTO" => "https" "HTTP_X_FORWARDED_FOR" => "216.73.216.84" "HTTP_SEC_FETCH_SITE" => "none" "HTTP_ACCEPT" => "text/html,application/xhtml+xml,application/xml;q=0.9,image/avif,image/webp,image/apng,*/*;q=0.8,application/signed-exchange;v=b3;q=0.7" "HTTP_USER_AGENT" => "Mozilla/5.0 AppleWebKit/537.36 (KHTML, like Gecko; compatible; ClaudeBot/1.0; +claudebot@anthropic.com)" "HTTP_UPGRADE_INSECURE_REQUESTS" => "1" "HTTP_SEC_CH_UA_PLATFORM" => ""Windows"" "HTTP_SEC_CH_UA_MOBILE" => "?0" "HTTP_SEC_CH_UA" => ""Chromium";v="130", "HeadlessChrome";v="130", "Not?A_Brand";v="99"" "HTTP_CACHE_CONTROL" => "no-cache" "HTTP_PRAGMA" => "no-cache" "HTTP_SEC_FETCH_DEST" => "document" "HTTP_CF_RAY" => "94deedccfc488737-ORD" "HTTP_ACCEPT_ENCODING" => "gzip, br" "HTTP_PRIORITY" => "u=0, i" "HTTP_SEC_FETCH_USER" => "?1" "HTTP_SEC_FETCH_MODE" => "navigate" "HTTP_CF_VISITOR" => "{"scheme":"https"}" "HTTP_HOST" => "www.corspedia.com" "REDIRECT_STATUS" => "200" "SERVER_NAME" => "corspedia.com" "SERVER_PORT" => "443" "SERVER_ADDR" => "141.95.147.152" "REMOTE_USER" => "" "REMOTE_PORT" => "42098" "REMOTE_ADDR" => "172.70.130.171" "SERVER_SOFTWARE" => "nginx/1.18.0" "GATEWAY_INTERFACE" => "CGI/1.1" "HTTPS" => "on" "REQUEST_SCHEME" => "https" "SERVER_PROTOCOL" => "HTTP/2.0" "DOCUMENT_ROOT" => "/var/www/corspedia/public" "DOCUMENT_URI" => "/index.php" "REQUEST_URI" => "/en/courses/information-extraction-from-free-text-data-in-health" "SCRIPT_NAME" => "/index.php" "CONTENT_LENGTH" => "" "CONTENT_TYPE" => "" "REQUEST_METHOD" => "GET" "QUERY_STRING" => "" "SCRIPT_FILENAME" => "/var/www/corspedia/public/index.php" "PATH_INFO" => "" "FCGI_ROLE" => "RESPONDER" "PHP_SELF" => "/index.php" "REQUEST_TIME_FLOAT" => 1749622054.1921 "REQUEST_TIME" => 1749622054 ]
        request_cookies
        []
        
        response_headers
        0 of 0
        array:5 [ "content-type" => array:1 [ 0 => "text/html; charset=UTF-8" ] "cache-control" => array:1 [ 0 => "no-cache, private" ] "date" => array:1 [ 0 => "Wed, 11 Jun 2025 06:07:34 GMT" ] "set-cookie" => array:2 [ 0 => "XSRF-TOKEN=eyJpdiI6IlFIZ1hYNFRXay93T0J1Tk1HZEpEa1E9PSIsInZhbHVlIjoiTXd2RTNXQ3BtZGVZenBXZlh1YTlURkF1cng3RzRib1poWUt5TFVOMmdyMTZoTGlFbUdtMVFuL0pZV2FyQUt4ZFRMa2Q5VmFDaXZ4bFhuKzY0bS9PNzVaOHd5WTZsSHl5NWl5MVJuS3o4N3ZrQ0NPM0pPdTdEclhMd3l5VDdGWXEiLCJtYWMiOiJlMjNkNjU2ODgwZWU1NmFiYWIwOGY5M2E4NzYwMWM0YjhlODQ5MjgyZjM2MDliNWNmZjQyNzQ1MGY5OTk1ZjUwIiwidGFnIjoiIn0%3D; expires=Wed, 11 Jun 2025 08:07:35 GMT; Max-Age=7200; path=/; samesite=laxXSRF-TOKEN=eyJpdiI6IlFIZ1hYNFRXay93T0J1Tk1HZEpEa1E9PSIsInZhbHVlIjoiTXd2RTNXQ3BtZGVZenBXZlh1YTlURkF1cng3RzRib1poWUt5TFVOMmdyMTZoTGlFbUdtMVFuL0pZV2FyQUt4ZFRMa2Q5V" 1 => "laravel_session=eyJpdiI6ImFwYkFLelJ2TkZvWko4MzdNSTd5b0E9PSIsInZhbHVlIjoiZ2RRTUdOSHFhWjltcFpUR2Z3bWhMeDg5VlhJM3ZwYTg0QWY5OE5Rb3hnQUxHTmpTQm55TC9zVDcrN0xSOHJtMVBHOHBGNGNFajlTbS9hQUVaMURiTzU4Nml4aEVkcisrZ211UkhWN0Jhd3Rsb3JWVHNLZUh5WTZxdDlNQlVwUDUiLCJtYWMiOiJhODMxYjllOWVlMDdhZDJkZTFmZmJjNjJmM2M0NWRiZTE1Nzk1MGFlODk4NmYyYmY4N2VhZThmNTA1YTg0YzA2IiwidGFnIjoiIn0%3D; expires=Wed, 11 Jun 2025 08:07:35 GMT; Max-Age=7200; path=/; httponly; samesite=laxlaravel_session=eyJpdiI6ImFwYkFLelJ2TkZvWko4MzdNSTd5b0E9PSIsInZhbHVlIjoiZ2RRTUdOSHFhWjltcFpUR2Z3bWhMeDg5VlhJM3ZwYTg0QWY5OE5Rb3hnQUxHTmpTQm55TC9zVDcrN0xSOHJtMVBH" ] "Set-Cookie" => array:2 [ 0 => "XSRF-TOKEN=eyJpdiI6IlFIZ1hYNFRXay93T0J1Tk1HZEpEa1E9PSIsInZhbHVlIjoiTXd2RTNXQ3BtZGVZenBXZlh1YTlURkF1cng3RzRib1poWUt5TFVOMmdyMTZoTGlFbUdtMVFuL0pZV2FyQUt4ZFRMa2Q5VmFDaXZ4bFhuKzY0bS9PNzVaOHd5WTZsSHl5NWl5MVJuS3o4N3ZrQ0NPM0pPdTdEclhMd3l5VDdGWXEiLCJtYWMiOiJlMjNkNjU2ODgwZWU1NmFiYWIwOGY5M2E4NzYwMWM0YjhlODQ5MjgyZjM2MDliNWNmZjQyNzQ1MGY5OTk1ZjUwIiwidGFnIjoiIn0%3D; expires=Wed, 11-Jun-2025 08:07:35 GMT; path=/XSRF-TOKEN=eyJpdiI6IlFIZ1hYNFRXay93T0J1Tk1HZEpEa1E9PSIsInZhbHVlIjoiTXd2RTNXQ3BtZGVZenBXZlh1YTlURkF1cng3RzRib1poWUt5TFVOMmdyMTZoTGlFbUdtMVFuL0pZV2FyQUt4ZFRMa2Q5V" 1 => "laravel_session=eyJpdiI6ImFwYkFLelJ2TkZvWko4MzdNSTd5b0E9PSIsInZhbHVlIjoiZ2RRTUdOSHFhWjltcFpUR2Z3bWhMeDg5VlhJM3ZwYTg0QWY5OE5Rb3hnQUxHTmpTQm55TC9zVDcrN0xSOHJtMVBHOHBGNGNFajlTbS9hQUVaMURiTzU4Nml4aEVkcisrZ211UkhWN0Jhd3Rsb3JWVHNLZUh5WTZxdDlNQlVwUDUiLCJtYWMiOiJhODMxYjllOWVlMDdhZDJkZTFmZmJjNjJmM2M0NWRiZTE1Nzk1MGFlODk4NmYyYmY4N2VhZThmNTA1YTg0YzA2IiwidGFnIjoiIn0%3D; expires=Wed, 11-Jun-2025 08:07:35 GMT; path=/; httponlylaravel_session=eyJpdiI6ImFwYkFLelJ2TkZvWko4MzdNSTd5b0E9PSIsInZhbHVlIjoiZ2RRTUdOSHFhWjltcFpUR2Z3bWhMeDg5VlhJM3ZwYTg0QWY5OE5Rb3hnQUxHTmpTQm55TC9zVDcrN0xSOHJtMVBH" ] ]
        session_attributes
        0 of 0
        array:5 [ "_token" => "nHJmH3RLAsUC2Q5nvaeE9iQhJDyd4ATLRuDu1gvD" "locale" => "en" "_previous" => array:1 [ "url" => "https://www.corspedia.com/en/courses/information-extraction-from-free-text-data-in-health" ] "_flash" => array:2 [ "old" => [] "new" => [] ] "PHPDEBUGBAR_STACK_DATA" => [] ]