fix: unify media reference handling by Soulter · Pull Request #8764 · AstrBotDevs/AstrBot

Soulter · 2026-06-13T15:40:41Z

Summary

Centralize media reference materialization and base64 resolution for local paths, http(s), base64://, data URIs, and legacy bare base64 payloads.
Normalize incoming Record audio to wav and Image media to temporary jpg during preprocess, with event-scoped cleanup.
Reuse the shared media resolver across OpenAI, Gemini, Anthropic, MiMo, DeerFlow, STT, and platform media paths while sanitizing logs and cleaning temporary conversion outputs.
Ensure generated TTS audio is tracked for cleanup after the event finishes.

Fixes #8676

Tests

uv run pytest tests/test_media_utils.py tests/test_agent_runner_media_resolver.py tests/test_platform_audio_media_resolver.py tests/test_openai_source.py tests/test_deerflow_agent_runner.py tests/test_discord_adapter.py tests/test_mattermost_adapter.py tests/test_kook/test_kook_client.py tests/test_whisper_api_source.py tests/test_preprocess_stage.py tests/unit/test_astr_main_agent.py
git diff --cached --name-only -- "*.py" | xargs uv run ruff check
git diff --cached --name-only -- "*.py" | xargs uv run ruff format --check
git diff --check

sourcery-ai

Sorry @Soulter, your pull request is larger than the review limit of 150000 diff characters

gemini-code-assist

Code Review

This pull request introduces a centralized MediaResolver utility to unify and normalize media resolution, format conversion, and base64 encoding across all platform adapters, agent runners, and LLM providers. It also updates the pipeline preprocess stage to normalize media files early and adds extensive unit tests. The review feedback highlights several key improvement opportunities, including resolving redundant base64 encoding/decoding in the Coze runner, broadening overly restrictive data: URI checks in message components, preventing a potential runtime crash in the Discord adapter due to empty media references, avoiding double-resolution of images in the QQOfficial adapter, and robustly stripping internal whitespace during base64 padding calculations.

Important

The consumer version of Gemini Code Assist on GitHub is being sunset. Starting June 18, 2026, new organization installations will be blocked, and all code review activity will officially cease on July 17, 2026.
For more details on the timeline and next steps, please review the Help Documentation.

gemini-code-assist · 2026-06-13T15:43:13Z

+            image_data = await MediaResolver(
+                image_url,
+                media_type="image",
+            ).to_base64_data(strict=True)
+            if image_data is None:
+                raise ValueError("invalid image data")
+            file_id = await self.api_client.upload_file(image_data.to_bytes())


Using to_base64_data(strict=True) followed by to_bytes() is inefficient because it performs redundant base64 encoding and decoding. You can use to_bytes() directly on the MediaResolver to retrieve the raw bytes efficiently.

image_bytes = await MediaResolver( image_url, media_type="image", ).to_bytes() file_id = await self.api_client.upload_file(image_bytes)

gemini-code-assist · 2026-06-13T15:43:13Z

            if (
-                self.file.startswith("file:///")
+                self.file.startswith("file://")
                or self.file.startswith("http")
                or self.file.startswith("base64://")
+                or self.file.startswith("data:audio/")
                or os.path.exists(self.file)
            ):
                return self.file

        # 2) 尝试 url（可能是 file:/// 或 http 链接）
        if self.url:
            if (
-                self.url.startswith("file:///")
+                self.url.startswith("file://")
                or self.url.startswith("http")
+                or self.url.startswith("data:audio/")
                or os.path.exists(self.url)
                or (
-                    self.url.startswith("file:///")
+                    self.url.startswith("file://")
                    and os.path.exists(self._decode_file_uri(self.url))
                )
            ):


Checking for data:audio/ is overly restrictive. A data URI can have other audio-related MIME types (e.g., data:audio/mpeg, data:audio/ogg) or generic ones like data:application/octet-stream. Since MediaResolver can handle any valid data URI, checking for data: is more robust and future-proof.

if ( self.file.startswith("file://") or self.file.startswith("http") or self.file.startswith("base64://") or self.file.startswith("data:") or os.path.exists(self.file) ): return self.file # 2) 尝试 url（可能是 file:/// 或 http 链接） if self.url: if ( self.url.startswith("file://") or self.url.startswith("http") or self.url.startswith("data:") or os.path.exists(self.url) or ( self.url.startswith("file://") and os.path.exists(self._decode_file_uri(self.url)) ) ):

gemini-code-assist · 2026-06-13T15:43:13Z

+            if (
+                candidate.startswith("file://")
+                or candidate.startswith("http")
+                or candidate.startswith("base64://")
+                or candidate.startswith("data:video/")
+                or os.path.exists(candidate)
+            ):


Checking for data:video/ is overly restrictive. A video data URI can have other MIME types or generic ones like data:application/octet-stream. Checking for data: is more robust and consistent with MediaResolver's capabilities.

Suggested change

if (

candidate.startswith("file://")

or candidate.startswith("http")

or candidate.startswith("base64://")

or candidate.startswith("data:video/")

or os.path.exists(candidate)

):

if (

candidate.startswith("file://")

or candidate.startswith("http")

or candidate.startswith("base64://")

or candidate.startswith("data:")

or os.path.exists(candidate)

):

gemini-code-assist · 2026-06-13T15:43:13Z

+                if not i.file:
                    raise ValueError("Unsupported image file format")
-                image_base64 = image_base64.removeprefix("base64://")
+                image_base64 = await MediaResolver(
+                    i.file,
+                    media_type="image",
+                ).to_base64()
+                if i.file.startswith("file://") or os.path.exists(i.file):
+                    image_file_path = await MediaResolver(
+                        i.file,
+                        media_type="image",
+                    ).to_path()


The current implementation resolves the same image twice (once for to_base64() and once for to_path()). If the image is a local file, we can resolve it to a path first, and then read the file directly to get the base64 data, avoiding redundant resolution lifecycles.

if not i.file: raise ValueError("Unsupported image file format") resolver = MediaResolver(i.file, media_type="image") if i.file.startswith("file://") or os.path.exists(i.file): image_file_path = await resolver.to_path() with open(image_file_path, "rb") as f: image_base64 = base64.b64encode(f.read()).decode("utf-8") else: image_base64 = await resolver.to_base64()

gemini-code-assist · 2026-06-13T15:43:13Z

+    validate: bool = False,
+) -> bytes:
+    """Decode base64 payloads while tolerating omitted padding."""
+    payload = payload.strip()


Using payload.strip() only removes leading and trailing whitespace. If the base64 payload contains internal whitespace (such as newlines from MIME base64), len(payload) % 4 can calculate the wrong padding length, leading to incorrect padding errors. Stripping all whitespace is much safer.

Suggested change

payload = payload.strip()

payload = "".join(payload.split())

…ring sanitization' Co-authored-by: Copilot Autofix powered by AI <62310815+github-advanced-security[bot]@users.noreply.github.com>

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

fix: unify media reference handling

6f21f9a

dosubot Bot added the size:XXL This PR changes 1000+ lines, ignoring generated files. label Jun 13, 2026

sourcery-ai Bot reviewed Jun 13, 2026

View reviewed changes

dosubot Bot added area:core The bug / feature is about astrbot's core, backend area:platform The bug / feature is about IM platform adapter, such as QQ, Lark, Telegram, WebChat and so on. area:provider The bug / feature is about AI Provider, Models, LLM Agent, LLM Agent Runner. labels Jun 13, 2026

github-advanced-security AI found potential problems Jun 13, 2026

View reviewed changes

Comment thread tests/test_media_utils.py Fixed

gemini-code-assist Bot reviewed Jun 13, 2026

View reviewed changes

Soulter and others added 7 commits June 13, 2026 23:57

fix: accept bare base64 record media refs

c43811e

chore: update agents.md

d3015a5

fix: unify file URI handling across media components and utilities

fa5491b

fix: unify media reference type handling with MediaRefStr alias

9e7047c

Potential fix for pull request finding 'CodeQL / Incomplete URL subst…

81432c1

…ring sanitization' Co-authored-by: Copilot Autofix powered by AI <62310815+github-advanced-security[bot]@users.noreply.github.com>

Update astrbot/core/platform/sources/discord/discord_platform_adapter.py

87d1446

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

fix: unify media handling and improve base64 decoding across components

409ddff

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: unify media reference handling#8764

fix: unify media reference handling#8764
Soulter wants to merge 8 commits into
masterfrom
fix/unify-media-resolution

Soulter commented Jun 13, 2026

Uh oh!

sourcery-ai Bot left a comment

Uh oh!

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

gemini-code-assist Bot Jun 13, 2026

Uh oh!

gemini-code-assist Bot Jun 13, 2026

Uh oh!

gemini-code-assist Bot Jun 13, 2026

Uh oh!

Uh oh!

gemini-code-assist Bot Jun 13, 2026

Uh oh!

gemini-code-assist Bot Jun 13, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

Soulter commented Jun 13, 2026

Summary

Tests

Uh oh!

sourcery-ai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist Bot Jun 13, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Jun 13, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Jun 13, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

gemini-code-assist Bot Jun 13, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Jun 13, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants