Please provide the content you want me to translate to Chinese. I need the text within the ```Sharp``` tags. Just paste it here, and I will translate it.
SHARP, an approach to photorealistic view synthesis from a single image

原始链接: https://apple.github.io/ml-sharp/

## SHARP:从单张图像合成逼真视图 苹果的研究人员推出了SHARP,一种从*一张*图像创建逼真3D场景表示的新方法。与之前的方法不同,SHARP能够快速地——在标准GPU上不到一秒钟内——通过一次神经网络传递直接从输入照片回归3D高斯表示的参数。 由此产生的3D表示能够实时渲染高分辨率、逼真的近景视图,并保持准确的比例以进行度量相机运动。SHARP在现有技术上表现出显著的改进,LPIPS分数降低了25-34%,DISTS分数降低了21-43%,同时合成时间大幅提升了三个数量级。它还在不同数据集上表现出强大的泛化能力,在单图像视图合成领域树立了新的最先进水平。

一个新的AI模型“Sharp”(apple.github.io)可以在一秒钟内从单张照片生成逼真的3D图像,在Hacker News上引发了讨论。用户对其速度和质量印象深刻,认为它在整体真实感方面表现出色,但承认其图像修复(图像补全)能力不如SVC等模型。 讨论延伸到图像创建之外的潜在应用。想法包括从照片自动生成真实世界物体的最小多边形近似值——简化设计和测量过程——以及它在苹果Cinematic模式和空间场景功能中的潜在作用。 一些人质疑投资这种视觉AI的价值,而另一些人则强调它在模拟中的实用性,尤其是在机器人和工业自动化等领域,因为目前创建3D场景既复杂又昂贵。还有一个有趣的提问,关于使用该AI生成苹果礼品卡的图像可能造成的后果。
相关文章

原文

Lars Mescheder, Wei Dong, Shiwei Li, Xuyang Bai, Marcel Santos, Peiyun Hu, Bruno Lecouat, Mingmin Zhen,

Amaël Delaunoy, Tian Fang, Yanghai Tsin, Stephan R. Richter, Vladlen Koltun

Apple

We present SHARP, an approach to photorealistic view synthesis from a single image. Given a single photograph, SHARP regresses the parameters of a 3D Gaussian representation of the depicted scene. This is done in less than a second on a standard GPU via a single feedforward pass through a neural network. The 3D Gaussian representation produced by SHARP can then be rendered in real time, yielding high-resolution photorealistic images for nearby views. The representation is metric, with absolute scale, supporting metric camera movements. Experimental results demonstrate that SHARP delivers robust zero-shot generalization across datasets. It sets a new state of the art on multiple datasets, reducing LPIPS by 25–34% and DISTS by 21–43% versus the best prior model, while lowering the synthesis time by three orders of magnitude.

Views synthesized by SHARP

SHARP synthesizes a photorealistic 3D representation from a single photograph in less than a second. The synthesized representation supports high-resolution rendering of nearby views, with sharp details and fine structures, at more than 100 frames per second on a standard GPU. We illustrate on photographs from Unsplash.

联系我们 contact @ memedata.com