ARXIV:2603.28003 · 3D AVATAR GENERATION · SUBMITTED 31 MAR · 20:20 UTC · FRESHNESS STALE

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

DipGuava: Disentangling Personalized Gaussian Features for 3D Head Avatars from Monocular Video

Jeonghaeng Lee · Seok Keun Choi · Zhixuan Li · Weisi Lin · Sanghoon Lee · arXiv

Generate photorealistic, identity-preserving 3D head avatars from single videos by disentangling appearance into geometry-driven and residual detail components.

Ship in 2-4 weeks›Score7.0Evidence unverified

Opportunity summary

Pain Generate photorealistic, identity-preserving 3D head avatars from single videos by disentangling appearance into geometry-driven and residual detail components.

Evidence 52 refs | 3 sources | 50% coverage

Blocker Evidence unverified

Open Build Read PDF Signal Canvas Track

PROBLEM

Generate photorealistic, identity-preserving 3D head avatars from single videos by disentangling appearance into geometry-driven and residual detail components. To fill this gap, we present DipGuava (Disentangled and Personalized Gaussian UV Avatar), a novel 3D…

METHOD

Full abstract

While recent 3D head avatar creation methods attempt to animate facial dynamics, they often fail to capture personalized details, limiting realism and expressiveness. To fill this gap, we present DipGuava (Disentangled and Personalized Gaussian UV Avatar), a novel 3D Gaussian head avatar creation method that successfully generates avatars with personalized attributes from monocular video. DipGuava is the first method to explicitly disentangle facial appearance into two complementary components, trained in a structured two-stage pipeline that significantly reduces learning ambiguity and enhances reconstruction fidelity. In the first stage, we learn a stable geometry-driven base appearance that captures global facial structure and coarse expression-dependent variations. In the second stage, the personalized residual details not captured in the first stage are predicted, including high-frequency components and nonlinearly varying features such as wrinkles and subtle skin deformations. These components are fused via dynamic appearance fusion that integrates residual details after deformation, ensuring spatial and semantic alignment. This disentangled design enables DipGuava to generate photorealistic, identity-preserving avatars, consistently outperforming prior methods in both visual quality and quantitativeperformance, as demonstrated in extensive experiments.

RESULT

ScienceToStartup currently rates this 7.0/10 on the public viability pass. This disentangled design enables DipGuava to generate photorealistic, identity-preserving avatars, consistently outperforming prior methods in both visual quality and quantitativeperformance, as demonstrated in extensive…

WHY NOW

3D Avatar Generation moved forward this cycle; last verified April 2026. Public score 7.0/10. Production flags indicate code availability.

Continue into Read for claims, analysis, references, and neighboring papers.

Opportunity summary

Score7.0

PainGenerate photorealistic, identity-preserving 3D head avatars from single videos by disentangling appearance into geometry-driven and residual detail components.

Evidence52 refs | 3 sources | 50% coverage

Blockerno shell-level blocker reported

Analysis summary

Generate photorealistic, identity-preserving 3D head avatars from single videos by disentangling appearance into geometry-driven and residual detail components.

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

Competitive landscape

Generate photorealistic, identity-preserving 3D head avatars from single videos by disentangling appearance into geometry-driven and residual detail components.

Segment

3D Avatar Generation

Adoption evidence

No public code link in the paper record yet

Commercial read

7.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

{ "contract_version": "paper-r2", "paper_id": "8a677e82-5bbd-4f96-8da7-4d42c4fdc2cb", "arxiv_id": "2603.28003", "canonical_route": "/paper/dipguava-disentangling-personalized-gaussian-features-for-3d-head-avatars-from-monocular-video", "active_tab": "synced from current hash by the drawer client", "selected_artifact": "dipguava-disentangling-personalized-gaussian-features-for-3d-head-avatars-from-monocular-video", "endpoints": { "paper_pack": "/api/v1/paper/dipguava-disentangling-personalized-gaussian-features-for-3d-head-avatars-from-monocular-video/paper-pack", "build_passport": "/api/v1/paper/dipguava-disentangling-personalized-gaussian-features-for-3d-head-avatars-from-monocular-video/build-passport", "mcp_resource": "sciencetostartup://surfaces/paper-workspace" } }

{ "surface": "paper", "mode": "paper", "query": "DipGuava: Disentangling Personalized Gaussian Features for 3D Head Avatars from Monocular Video", "normalized_query": "2603.28003", "route": "/paper/dipguava-disentangling-personalized-gaussian-features-for-3d-head-avatars-from-monocular-video", "paper_ref": "dipguava-disentangling-personalized-gaussian-features-for-3d-head-avatars-from-monocular-video", "topic_slug": null, "benchmark_ref": null, "dataset_ref": null }

{ "@context": "https://schema.org", "@graph": [ { "@type": "WebPage", "@id": "https://sciencetostartup.com/paper/dipguava-disentangling-personalized-gaussian-features-for-3d-head-avatars-from-monocular-video#webpage", "url": "https://sciencetostartup.com/paper/dipguava-disentangling-personalized-gaussian-features-for-3d-head-avatars-from-monocular-video", "name": "DipGuava: Disentangling Personalized Gaussian Features for 3D Head Avatars from Monocular Video", "description": "Generate photorealistic, identity-preserving 3D head avatars from single videos by disentangling appearance into geometry-driven and residual detail components.", "isPartOf": { "@id": "https://sciencetostartup.com/#website" } }, { "@type": "ScholarlyArticle", "@id": "https://sciencetostartup.com/paper/dipguava-disentangling-personalized-gaussian-features-for-3d-head-avatars-from-monocular-video#scholarlyArticle", "headline": "DipGuava: Disentangling Personalized Gaussian Features for 3D Head Avatars from Monocular Video", "description": "Generate photorealistic, identity-preserving 3D head avatars from single videos by disentangling appearance into geometry-driven and residual detail components.", "url": "https://sciencetostartup.com/paper/dipguava-disentangling-personalized-gaussian-features-for-3d-head-avatars-from-monocular-video", "sameAs": "https://arxiv.org/abs/2603.28003", "identifier": { "@type": "PropertyValue", "propertyID": "arXiv", "value": "2603.28003" }, "isAccessibleForFree": true, "isPartOf": { "@id": "https://sciencetostartup.com/#website" }, "datePublished": "2026-03-30T03:50:23.000Z", "author": [ { "@type": "Person", "name": "Jeonghaeng Lee" }, { "@type": "Person", "name": "Seok Keun Choi" }, { "@type": "Person", "name": "Zhixuan Li" }, { "@type": "Person", "name": "Weisi Lin" }, { "@type": "Person", "name": "Sanghoon Lee" } ], "additionalProperty": [ { "@type": "PropertyValue", "propertyID": "viabilityScore", "value": 7 }, { "@type": "PropertyValue", "propertyID": "researchDomain", "value": "3D Avatar Generation" }, { "@type": "PropertyValue", "propertyID": "commercialReadiness", "value": "code" } ] }, { "@type": "BreadcrumbList", "itemListElement": [ { "@type": "ListItem", "position": 1, "name": "Home", "item": "https://sciencetostartup.com" }, { "@type": "ListItem", "position": 2, "name": "3D Avatar Generation", "item": "https://sciencetostartup.com/topics" }, { "@type": "ListItem", "position": 3, "name": "DipGuava: Disentangling Personalized Gaussian Features for 3", "item": "https://sciencetostartup.com/paper/dipguava-disentangling-personalized-gaussian-features-for-3d-head-avatars-from-monocular-video" } ] } ] }

Competitive landscape

Generate photorealistic, identity-preserving 3D head avatars from single videos by disentangling appearance into geometry-driven and residual detail components.

Segment

3D Avatar Generation

Adoption evidence

No public code link in the paper record yet

Commercial read

7.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

DipGuava: Disentangling Personalized Gaussian Features for 3D Head Avatars from Monocular Video

DipGuava: Disentangling Personalized Gaussian Features for 3D Head Avatars from Monocular Video

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline