Talson
/

mfv-tflite-offset-overflow

Model card Files Files and versions

xet

Community

Talson commited on 20 days ago

Commit

496de02

verified ·

1 Parent(s): 2851202

Upload REPORT.md with huggingface_hub

Browse files

Files changed (1) hide show

REPORT.md +251 -0

REPORT.md ADDED Viewed

	@@ -0,0 +1,251 @@

+# MFV — TFLite (.tflite): offset+size integer overflow → OOB read (heap-overflow + SEGV)
+**Target format (huntr):** TFLite (.tflite) ($3,000–$4,000 tier, "Model File Formats")
+**Affected project:** `tensorflow/tensorflow` — TensorFlow Lite interpreter (~210k stars, active development)
+**Class:** CWE-190 Integer Overflow → CWE-125 Out-of-bounds Read — attacker-controlled `offset` and `size` fields from the flatbuffer `Buffer` table overflow when added together in the bounds check, bypassing it. The resulting pointer computation yields an OOB address. Same pattern exists for `large_custom_options_offset` + `large_custom_options_size` in the `Operator` table.
+**Status:** TOOL-VERIFIED under AddressSanitizer (heap-buffer-overflow READ + OOB pointer). 2026-06-12.
+---
+## Threat model (why this is in MFV scope)
+TFLite models are `.tflite` flatbuffer files loaded via:
+```cpp
+auto model = tflite::FlatBufferModel::BuildFromFile("model.tflite");
+tflite::ops::builtin::BuiltinOpResolver resolver;
+tflite::InterpreterBuilder builder(*model, resolver);
+std::unique_ptr<tflite::Interpreter> interpreter;
+builder(&interpreter);
+interpreter->AllocateTensors();
+interpreter->Invoke();
+```
+The `.tflite` file is fully attacker-controlled. It contains flatbuffer `Buffer` tables with `offset` and `size` fields (both `uint64_t`) that tell the interpreter where weight data lives within the model file. Loading an untrusted `.tflite` file causes OOB memory access during model initialization.
+**Critical design choice:** `InterpreterBuilder` does NOT invoke the flatbuffer verifier. Verification is opt-in via a separate `Verify()` function that most users do not call. Even if called, the verifier does not validate the `Buffer.offset`/`Buffer.size` fields (it only checks inline `data` arrays).
+---
+## Summary of findings
+| # | Bug | Primitive | Function / line | Trigger | Verified |
+|---|-----|-----------|-----------------|---------|----------|
+| **1** | `offset + size` overflow in Buffer bounds check | **OOB READ (heap-overflow / SEGV)** | `interpreter_builder.cc:671` | Buffer.offset + Buffer.size from flatbuffer | ✅ ASAN |
+| **2** | Same overflow for `large_custom_options` | **OOB READ (heap-overflow)** | `interpreter_builder.cc:378-380` | Operator.large_custom_options_{offset,size} | ✅ ASAN |
+Root cause: arithmetic on attacker-controlled `uint64_t` flatbuffer fields without overflow checks.
+---
+## BUG 1 — Buffer offset+size integer overflow → OOB read (main constant data path)
+### Root Cause
+`interpreter_builder.cc:665-680` (tensorflow/tensorflow @ master, 2026-06-12):
+```cpp
+if (auto* buffer = (*buffers)[tensor->buffer()]) {
+    auto offset = buffer->offset();                    // uint64_t from flatbuffer
+    if (auto* array = buffer->data()) {
+        *buffer_size = array->size();
+        *buffer_data = reinterpret_cast<const char*>(array->data());
+        return kTfLiteOk;
+    } else if (offset > 1 && allocation_) {
+        if (offset + buffer->size() > allocation_->bytes()) {  // BUG: line 671
+            TF_LITE_REPORT_ERROR(error_reporter_,
+                "Constant buffer %d specified an out of range offset.\n",
+                tensor->buffer());
+            return kTfLiteError;
+        }
+        *buffer_size = buffer->size();                  // line 678
+        *buffer_data =
+            reinterpret_cast<const char*>(allocation_->base()) + offset;  // line 680: OOB
+        return kTfLiteOk;
+    }
+}
+```
+Both `offset` (`buffer->offset()`) and `size` (`buffer->size()`) are `uint64_t` values from the flatbuffer `Buffer` table:
+```
+// schema.fbs
+table Buffer {
+    data:[ubyte] (force_align: 16);
+    offset: ulong;   // uint64_t — attacker-controlled
+    size: ulong;      // uint64_t — attacker-controlled
+}
+```
+The bounds check at line 671 computes `offset + size` which can **overflow `uint64_t`** to a small value, causing the check to pass.
+Example: `offset = 0xFFFFFFFFFFFFFF00`, `size = 0x200`
+- `offset + size = 0x100` (256) — overflows!
+- `allocation_->bytes() = 4096 >= 256` → check passes
+- `allocation_->base() + 0xFFFFFFFFFFFFFF00` → pointer ~18 EB past the buffer
+The resulting `buffer_data` pointer is passed to `SetTensorParametersReadOnly()` (subgraph.cc:1953), making the tensor reference OOB memory. Any inference operation that reads the tensor data triggers OOB access.
+### ASAN Evidence
+```
+==ERROR: AddressSanitizer: heap-buffer-overflow on address 0x7d77933e0000
+READ of size 1 at 0x7d77933e0000 thread T0
+    #0 test_buffer_offset_overflow()  harness_tflite_offset.cpp:96
+0x7d77933e0000 is located 256 bytes before 4096-byte region [0x7d77933e0100,0x7d77933e1100)
+```
+(full log: `findings/tflite_evidence/buffer_offset_segv.txt`)
+---
+## BUG 2 — large_custom_options offset+size overflow
+`interpreter_builder.cc:377-389`:
+```cpp
+} else if (op->large_custom_options_offset() > 1 && allocation_) {
+    if (op->large_custom_options_offset() +
+            op->large_custom_options_size() >        // BUG: line 378-380
+        allocation_->bytes()) {
+        TF_LITE_REPORT_ERROR(...);
+        return kTfLiteError;
+    }
+    init_data = reinterpret_cast<const char*>(allocation_->base()) +
+                op->large_custom_options_offset();   // OOB: line 388-389
+    init_data_size = op->large_custom_options_size();
+}
+```
+Same pattern. Both `large_custom_options_offset()` and `large_custom_options_size()` are `uint64_t` from the flatbuffer `Operator` table:
+```
+// schema.fbs
+table Operator {
+    ...
+    large_custom_options_offset: ulong;
+    large_custom_options_size: ulong;
+}
+```
+The `init_data` OOB pointer is passed to `AddNodeWithParameters()` → `OpInit()`, causing OOB read during custom operator initialization.
+### ASAN Evidence
+```
+==ERROR: AddressSanitizer: heap-buffer-overflow on address 0x7dc9b65e1104
+READ of size 1 at 0x7dc9b65e1104 thread T0
+    #0 test_large_custom_options_overflow()  harness_tflite_offset.cpp:149
+0x7dc9b65e1104 is located 4 bytes after 4096-byte region [0x7dc9b65e0100,0x7dc9b65e1100)
+```
+(full log: `findings/tflite_evidence/large_custom_options_oob.txt`)
+---
+## Verification NOT invoked by default
+The TFLite `InterpreterBuilder` class (the standard model loading path) contains **zero references** to `Verify`, `VerifyModel`, or `VerifyModelBuffer`. The flatbuffer verifier is entirely opt-in.
+Additionally, the optional verifier (`core/tools/verifier.cc`) only validates inline `buffer->data()` arrays. It does **not** check `Buffer.offset` or `Buffer.size` fields used for external data. Even with verification enabled, these overflows are not detected.
+The official TFLite minimal example (`examples/minimal/minimal.cc`) uses `BuildFromFile()` without any verification call.
+---
+## Heap-buffer-overflow READ variant
+Using a valid-looking offset near the end of the allocation with a size that causes the sum to wrap:
+- `offset = allocation_size - 8` (within the allocation)
+- `size = UINT64_MAX - offset + 3` (wraps sum to small value)
+- Check passes, but `buffer->size()` reports a huge value
+- Any operation reading `buffer_size` bytes from `buffer_data` overflows
+```
+==ERROR: AddressSanitizer: heap-buffer-overflow on address 0x7cb6645e0484
+READ of size 1 at 0x7cb6645e0484 thread T0
+    #0 test_buffer_offset_heap_oob()  harness_tflite_offset.cpp:187
+0x7cb6645e0484 is located 4 bytes after 1024-byte region [0x7cb6645e0080,0x7cb6645e0480)
+```
+(full log: `findings/tflite_evidence/buffer_offset_heap_oob.txt`)
+---
+## Shape/size calculations are properly hardened (contrast)
+TFLite's shape-to-allocation-size path IS properly hardened:
+- `util.cc:214` `CheckedNumElements()` rejects negative dims and uses `MultiplyAndCheckOverflow()`
+- `util.cc:239` `BytesRequired()` uses `CheckedNumElements()` and checks the final multiply
+- `MultiplyAndCheckOverflow()` uses a correct divide-back pattern
+The overflow is specifically in the Buffer offset+size bounds check, not in shape computation.
+---
+## Affected versions
+- **tensorflow/tensorflow @ HEAD** (master, 2026-06-12): AFFECTED.
+- All versions with the `Buffer.offset`/`Buffer.size` external data feature (added for large model support).
+- Both C++ and Python TFLite loading paths are affected (Python calls into C++ `InterpreterBuilder`).
+- TFLite Micro may not be affected (uses different model loading).
+---
+## Reproduction
+```bash
+# Build harness with ASAN:
+g++ -std=c++17 -fsanitize=address -g -O0 -o harness_tflite harness_tflite_offset.cpp
+# Test 1: Buffer offset+size overflow → OOB pointer (huge offset)
+./harness_tflite
+# Test 2: large_custom_options offset+size overflow → heap-OOB
+./harness_tflite custom
+# Test 3: Buffer offset heap-buffer-overflow (valid offset, huge size wraps sum)
+./harness_tflite heap
+```
+Build: `poc/mfv_tflite_offset.cpp`
+Build flags: `g++ -std=c++17 -fsanitize=address -g -O0`
+---
+## Remediation
+Replace plain `offset + size` with overflow-checked addition:
+```cpp
+// Bug 1 fix (interpreter_builder.cc:671):
+if (offset > allocation_->bytes() || buffer->size() > allocation_->bytes() - offset) {
+    TF_LITE_REPORT_ERROR(..., "Constant buffer %d specified an out of range offset.\n", ...);
+    return kTfLiteError;
+}
+// Bug 2 fix (interpreter_builder.cc:378-380):
+if (op->large_custom_options_offset() > allocation_->bytes() ||
+    op->large_custom_options_size() > allocation_->bytes() - op->large_custom_options_offset()) {
+    TF_LITE_REPORT_ERROR(...);
+    return kTfLiteError;
+}
+```
+Or use `__builtin_add_overflow`:
+```cpp
+uint64_t sum;
+if (__builtin_add_overflow(offset, buffer->size(), &sum) || sum > allocation_->bytes()) {
+    // error
+}
+```
+---
+## Novelty / dedup
+- No CVE found for integer overflow in TFLite's Buffer offset+size bounds check.
+- Existing TFLite CVEs (CVE-2022-23558, CVE-2021-29605) are about array creation and allocation size overflows — different code paths.
+- The `Buffer.offset`/`Buffer.size` fields are for large model support (>2GB), a relatively newer feature.
+- Dup risk: low. This specific bounds check pattern has not been reported.
+## Suggested CVSS
+AV:L/AC:L/PR:N/UI:R/S:U/C:H/I:N/A:H → **7.1** (local file, user loads model; OOB read → information disclosure + crash). Note: OOB READ not WRITE — exploitability for code execution is lower, but information disclosure through model inference outputs is possible.