Intel处理器不稳定导致Oodle解压失败
Intel Processor Instability Causing Oodle Decompression Failures

原始链接: https://www.radgametools.com/oodleintel.htm

问题:Intel 处理器不稳定导致 Ooodle 数据压缩失败 一些使用 RAD 技术安装了基于虚幻引擎的游戏的客户遇到了莫名其妙的错误,特别是那些使用 Ooodle 压缩算法的错误。 这些问题主要出现在 Intel 13900K 和 14900K 处理器中,在 13700、14700 等相关型号中观察到的频率较低。报告表明,某些 BIOS 设置中的特定配置在承受密集负载时会导致系统不稳定。 这些不稳定性通过错误表现出来,包括 OODLE 数据解压缩尝试失败或启动期间崩溃。 该问题似乎是由于过度热衷的 BIOS 设置导致高时钟速率和对系统施加过高的负载而引起的,这通常在处理大量并行处理的应用程序中检测到。 虽然 OODLE 或 Unreal 中似乎不太可能存在任何技术故障,但这些错误性能指标背后的可能罪魁祸首可能在于特定英特尔芯片在特定情况下执行的错误计算。 要解决此问题,建议用户考虑修改当前的 CPU 设置。 一种流行的补救措施包括通过 Intel XTU 等工具或访问特定 BIOS 固件版本中相应的高级子菜单来降低 CPU 基本功率、短期和长期功率限制等。 建议的替代解决方案包括重置增强型睿频和多处理核心的时钟速度级别,同时激活英特尔的故障安全模式或应用特定产品手册中概述的相应电压增量。 一般来说,虽然上述措施可以稳定并恢复机器的最佳功能,但用户必须保持谨慎并避免不当的配置调整,以免发生严重的系统降级、故障或物理组件故障。 如果重新配置后持续不稳定或用户寻求额外帮助,我们鼓励他们通过既定的支持机制与相关供应商或分销商联系,以减轻进一步的复杂性。

然而,作者的回应提出了一个有趣的问题:人工智能生成的音乐、艺术、诗歌和视频等内容能否准确反映人类的创造力和独特性,或者它们仅仅是现有风格和技术的复制品,并添加了不同程度的原创性或新颖性? 与涉及个人人类输入的传统方法相比,人工智能在产生真正独特和突破性的创意输出方面有多有效? 哪些因素决定了人工智能生成的内容在多大程度上接近或超越人类的艺术表达? 是否有特定的流派、主题或媒介特别适合人工智能生成或挑战当前的能力和局限性? 人工智能在与人类一起创造原创和创新内容方面应该发挥什么作用(如果有的话)?它可以以什么方式增强或支持他们的努力? 此外,将人工智能技术纳入文化产品和服务的创造、传播和消费的长期影响和后果是什么,以及个人、社区、机构和行业如何负责任和有效地驾驭和管理这些趋势和发展?
相关文章

原文

RAD has become aware of a problem that can cause Oodle Data decompression failures, or crashes in games built with Unreal. We believe that this is a hardware problem which affects primarily Intel 13900K and 14900K processors, less likely 13700, 14700 and other related processors as well. Only a small fraction of those processors will exhibit this behavior. The problem seems to be caused by a combination of BIOS settings and the high clock rates and power usage of these processors, leading to system instability and unpredictable behavior under heavy load.

As far as we can tell, there is not any software bug in Oodle or Unreal that is causing this. Due to what seem to be overly optimistic BIOS settings, some small percentage of processors go out of their functional range of clock rate and power draw under high load, and execute instructions incorrectly. This is being seen disproportionately in Oodle Data decompression because unlike most gameplay, simulation, audio or rendering code, decompression needs to perform extra integrity checks to handle accidentally or maliciously corrupted data, and is thus likely to spot inconsistencies very soon after they occur. These decode failures then typically result in an error message.

When starting an Unreal Engine-based game, the most common failure is of this type:

DecompressShader(): Could not decompress shader (GetShaderCompressionFormat=Oodle)

However, this problem does not only affect Oodle, and machines that suffer from this instability will also exhibit failures in standard benchmark and stress test programs. Any programs which heavily use the processor on many threads may cause crashes or unpredictable behavior. There have been crashes seen in RealBench, CineBench, Prime95, Handbrake, Visual Studio, and more. This problem can also show up as a GPU error message, such as spurious "out of video memory" errors, even though it is caused by the CPU.

We do not have acccess to diagnostic processor information that would nail down the exact cause and best workaround for this problem. It seems that many motherboard/BIOS manufacturers are shipping with settings that push the processor outside its safe operating range. Because this problem appears to affect only a small fraction of processors, some users have had success with returning their processor to the manufacturer and getting a new one which doesn't exhibit the problem.

Other workarounds require using tuning utilities or modifying BIOS settings. Note that doing so incorrectly can cause damage to your system. The changes we are recommending here are, to the best of our knowledge, completely safe, but you are solely responsible for any damages or loss caused by changing these settings from their factory defaults. If you are uncomfortable or worried about using tuning utilities (even officially sanctioned ones) or changing your BIOS settings, and frequent crashes also occur in the benchmark programs mentioned previously, you should be able to return the CPU or the entire computer to the manufacturer instead.

A reportedly successful workaround for many people is to use Intel XTU and lower the Performance Core multiplier from x55 to x54 or x53. Apparently, affected titles may then crash one more time during load immediately after, but will work afterwards (we have not been able to confirm this ourselves). Using XTU is likely the quickest and easiest way to go and doesn't even require rebooting to try different settings, but you might need to reapply it after every start if you don't save the profile. (Alternatively, set the P-core multiplier in the BIOS instead.)

In the BIOS, if you have enabled any overclocking, please turn it off; do not use "AI" or "automatic" overclocking. Even if you have not explicitly enabled any overclocking, many BIOSes are doing some by default, so on affected machines you will have to find those settings and turn them off. Every BIOS has slightly different names for the settings; we cannot provide exact instructions of which settings to look for all of them. Some of these settings may be in the Advanced or Overclocking submenu of the BIOS.

  1. First look for settings to put the power limits and voltages of the processor into the Intel recommended safe ranges. You can find the correct limits for your processor at ark.intel.com. These might be:
  • "SVID behavior" → "Intel fail safe"
  • "Long duration power limit" → reduce to 125W if set higher ("Processor Base Power" on ARK)
  • "Short duration power limit" → reduce to 253W if set higher (for 13900/14900 CPUs, other CPUs have other limits! "Maximum Turbo Power" on ARK)
  • If those don't work, another thing to look for is BIOS "enhanced turbo" or "enhanced multithreading" settings. For example:
    • "ASUS MultiCore Enhancement" → disabled (not Auto)
    • "ASUS Performance Enhancement 3.0" → disabled
  • There have been reports of users finding stability by turning down the maximum clock rate. This can be done with BIOS settings or with Intel XTU. Some possibilities:
    • Turn down the maximum P core multiplier from 55X to 53X or 54X. (for example)
    • Turn down maximum turbo boost clock rate
    • Turn off or turn down "thermal velocity boost"

    Note that many motherboard/BIOS settings turn on XMP (Extreme Memory Profile) by default with unstable settings that can cause similar symptoms. Instability due to XMP is a separate issue, but if you have instability problems, you may wish to also disable XMP and see if that helps.



    Troubleshooting Update from Intel Corporation - Additional troubleshooting steps for ASUS, Gigabyte and MSI motherboards:

    First, install Intel XTU and run the AVX2 test. If the AVX2 test failure is seen, try these options:

    1. For ASUS:
      Ask customer to change BIOS settings: Advanced (F7)- SVID Behavior: Change to "Intel's Fail Safe"
      Reboot the OS and run XTU test again and if the AVX2 test can pass. Run games and see if the issue happens again.
    2. For Gigabyte:
      Solution A): In BIOS, select "ADVANCED MODE", in the Tweaker tab, locate the CPU Vcore and select "Normal" option, select "Dynamic Vcore(DVID)" option, change it from "Auto" to "+0.005V" Increase the DVID by +0.005 and reboot OS, until the game crash disappears and the system is running stable.
      Solution B): In BIOS, select "Tweaker", select "Advanced Voltage Settings", select "CPU/VRAM Settings", adjust "CPU Vcore Loadline Calibration", recommend starting from "Low" to "Medium" until system is stable.
      After implementing solution A or B, run the XTU test again and if the AVX2 test can pass. Run games and see if the issue happens again.
    3. For MSI:
      Solution A): In BIOS, select "OC", select "CPU Core Voltage Mode", select "Offset Mode", select "+(By PWM)", adjust the voltage until the system is stable, recommend not to exceed 0.025V for a single increase.
      Solution B): In BIOS, select "OC", select "DigitALL Power", change "CPU Loadline Calibration Control", recommend starting from "Mode 7" to a lower value until system is stable.
      After implementing solution A or B, run the XTU test again and if the AVX2 test can pass. Run games and see if the issue happens again.

    Note that we cannot directly support end users of games impacted by this problem, please use the official support channels of the game publisher, as well as the support channels of the processor and motherboard manufacturers.


    联系我们 contact @ memedata.com