Microsoft 365 Roadmap

RM381750Microsoft Purview compliance portal: Data Loss Prevention for endpoints - Optical character recognition (OCR) support for embedded images in endpoint

Summary

This release will extend OCR support from standalone images (JPEG, JPG, PNG, BMP, TIFF, and PDF) to images embedded inside the following files and file types: Office files (XLSX, DOCX, PPTX), container files (zip, rar, 7z, and more), and PDF files. Image-only PDF files are already supported, and this this release will support hybrid PDF files containing images and searchable text. Updated May 20, 2026: We have paused rollout and will resume soon. Thank you for your patience.

Description

This release will extend OCR support from standalone images (JPEG, JPG, PNG, BMP, TIFF, and PDF) to images embedded inside the following files and file types: Office files (XLSX, DOCX, PPTX), container files (zip, rar, 7z, and more), and PDF files. Image-only PDF files are already supported, and this this release will support hybrid PDF files containing images and searchable text. Updated May 20, 2026: We have paused rollout and will resume soon. Thank you for your patience.

GA date: May CY2026

Preview date: April CY2026

Version history

3 versions tracked

Updated 2 times since Apr 2, 2026. Microsoft 365 Message Center only shows the current version; this archive preserves tracked history.

Compare any two versions

From
To
  1. May 20, 2026 - 11:15 PMLatest - v3

    Changed: Body, Tags, Status

  2. May 19, 2026 - 10:45 PMv2

    Changed: Tags, Status

  3. Apr 2, 2026 - 11:15 PMOriginal - v1

    Changed: Initial version