Latent Code Research Articles

Producing smooth and accurate motions from sparse videos without requiring specialized equipment and markers is a long-standing problem in the research community. Most approaches typically involve complex processes such as temporal constraints, multiple stages combining data-driven regression and optimization techniques, and bundle solving over temporal windows. These increase the computational burden and introduce the challenge of hyperparameter tuning for the different objective terms. In contrast, BundleMoCap++ offers a simple yet effective approach to this problem. It solves the motion in a single stage, eliminating the need for temporal smoothness objectives while still delivering smooth motions without compromising accuracy. BundleMoCap++ outperforms the state-of-the-art without increasing complexity. Our approach is based on manifold interpolation between latent keyframes. By relying on a local manifold smoothness assumption and appropriate interpolation schemes, we efficiently solve a bundle of frames using two or more latent codes. Additionally, the method is implemented as a sliding window optimization and requires only the first frame to be properly initialized, reducing the overall computational burden. BundleMoCap++’s strength lies in achieving high-quality motion capture results with fewer computational resources. To do this efficiently, we propose a novel human pose prior that focuses on the geometric aspect of the latent space, modeling it as a hypersphere, allowing for the introduction of sophisticated interpolation techniques. We also propose an algorithm for optimizing the latent variables directly on the learned manifold, improving convergence and performance. Finally, we introduce high-order interpolation techniques adapted for the hypersphere, allowing us to increase the solving temporal window, enhancing performance and efficiency.

How the human brain integrates internally- (i.e., mnemonic) and externally-oriented (i.e., perceptual) information is a long-standing puzzle in neuroscience. In particular, the internally-oriented networks like the default network (DN) and externally-oriented dorsal attention networks (dATNs) are thought to be globally competitive, which implies DN disengagement during cognitive states that drive the dATNs and vice versa. If these networks are globally opposed, how is internal and external information integrated across these networks? Here, using precision neuroimaging methods, we show that these internal/external networks are not as dissociated as traditionally thought. Using densely sampled high-resolution fMRI data, we defined individualized whole-brain networks from participants at rest, and the retinotopic preferences of individual voxels within these networks during an independent visual mapping task. We show that while the overall network activity between the DN and dATN is opponent at rest, a latent retinotopic code structures this global opponency. Specifically, the anti-correlation (i.e., global opponency) between the DN and dATN at rest is structured at the voxel-level by each voxel's retinotopic preferences, such that the spontaneous activity of voxels preferring similar visual field locations are more anti-correlated than those that prefer different visual field locations. Further, this retinotopic scaffold integrates with the domain-specific preferences of subregions within these networks, enabling efficient, parallel processing of retinotopic and domain-specific information. Thus, DN and dATN dynamics are opponent, but not competitive: voxel-scale anti-correlation between these networks preserves and encodes information in the negative BOLD responses, even in the absence of visual input or task demands. These findings suggest that retinotopic coding may serve as a fundamental organizing principle for brain-wide communication, providing a new framework for understanding how the brain balances and integrates internal cognition with external perception.

Latent Code Research Articles

Related Topics

Articles published on Latent Code

VMP: Versatile Motion Priors for Robustly Tracking Motion on Physical Characters

BundleMoCap++: Efficient, robust and smooth motion capture from sparse multiview videos

Self-Organizing a Latent Hierarchy of Sketch Patterns for Controllable Sketch Synthesis.

Retinotopic coding organizes the opponent dynamic between internally and externally oriented brain networks.

Text-Driven Video Prediction

VoiceStyle: Voice-Based Face Generation via Cross-Modal Prototype Contrastive Learning

LoopNet for fine-grained fashion attributes editing

Quantum deep generative prior with programmable quantum circuits

Audio2Gestures: Generating Diverse Gestures From Audio.

ANISE: Assembly-Based Neural Implicit Surface Reconstruction.

LC-NeRF: Local Controllable Face Generation in Neural Radiance Field.

BrepGen: A B-rep Generative Diffusion Model with Structured Latent Geometry

Face swapping with adaptive exploration-fusion mechanism and dual en-decoding tactic

DeCoGAN: MVCT image denoising via coupled generative adversarial network

SAC-GAN: Structure-Aware Image Composition.

HairStyle Editing via Parametric Controllable Strokes.

Diffusion-Based Causal Representation Learning.

InfoSTGCAN: An Information-Maximizing Spatial-Temporal Graph Convolutional Attention Network for Heterogeneous Human Trajectory Prediction

Radio Galaxy Zoo: Leveraging latent space representations from variational autoencoder

Application of C-InGAN Model in Interpretable Feature of Bearing Fault Diagnosis.

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Latent Code Research Articles

Related Topics

Articles published on Latent Code

VMP: Versatile Motion Priors for Robustly Tracking Motion on Physical Characters

BundleMoCap++: Efficient, robust and smooth motion capture from sparse multiview videos

Self-Organizing a Latent Hierarchy of Sketch Patterns for Controllable Sketch Synthesis.

Retinotopic coding organizes the opponent dynamic between internally and externally oriented brain networks.

Text-Driven Video Prediction

VoiceStyle: Voice-Based Face Generation via Cross-Modal Prototype Contrastive Learning

LoopNet for fine-grained fashion attributes editing

Quantum deep generative prior with programmable quantum circuits

Audio2Gestures: Generating Diverse Gestures From Audio.

ANISE: Assembly-Based Neural Implicit Surface Reconstruction.

LC-NeRF: Local Controllable Face Generation in Neural Radiance Field.

BrepGen: A B-rep Generative Diffusion Model with Structured Latent Geometry

Face swapping with adaptive exploration-fusion mechanism and dual en-decoding tactic

DeCoGAN: MVCT image denoising via coupled generative adversarial network

SAC-GAN: Structure-Aware Image Composition.

HairStyle Editing via Parametric Controllable Strokes.

Diffusion-Based Causal Representation Learning.

InfoSTGCAN: An Information-Maximizing Spatial-Temporal Graph Convolutional Attention Network for Heterogeneous Human Trajectory Prediction

Radio Galaxy Zoo: Leveraging latent space representations from variational autoencoder

Application of C-InGAN Model in Interpretable Feature of Bearing Fault Diagnosis.