We propose a cross-modality manifold alignment procedure that leverages triplet loss to jointly learn consistent, multi-modal embeddings of language-based concepts of real-world items. Our approach learns these embeddings by sampling triples of anchor, positive, and negative data points from RGB-depth images and their natural language descriptions. We show that our approach can benefit from, but does not require, post-processing steps such as Procrustes analysis, in contrast to some of our baselines which require it for reasonable performance. We demonstrate the effectiveness of our approach on two datasets commonly used to develop robotic-based grounded language learning systems, where our approach outperforms four baselines, including a state-of-the-art approach, across five evaluation metrics.
more »
« less
Orthogonal Procrustes and norm-dependent optimality
This note revisits the classical orthogonal Procrustes problem and investigates the norm-dependent geometric behavior underlying Procrustes alignment for subspaces. It presents generic, deterministic bounds quantifying the performance of a specified Procrustes-based choice of subspace alignment. Numerical examples illustrate the theoretical observations and offer additional, empirical findings which are discussed in detail. This note complements recent advances in statistics involving Procrustean matrix perturbation decompositions and eigenvector estimation.
more »
« less
- Award ID(s):
- 1902755
- PAR ID:
- 10303825
- Date Published:
- Journal Name:
- The Electronic Journal of Linear Algebra
- Volume:
- 36
- Issue:
- 36
- ISSN:
- 1081-3810
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
-
-
The foot plays a prominent role in weight-bearing suggesting it may reflect locomotor variation. Despite the immense amount of foot research, the calcaneus has been relatively understudied. Here we analyzed the entire calcaneal shape of Gorilla gorilla gorilla (n=41), Gorilla beringei graueri (n=17) and Gorilla beringei beringei (n=8) to understand how morphology relates to locomotor behavior. Calcanei were surface scanned and external shape analyzed using a three-dimensional geometric morphometric sliding semilandmark analysis. Semilandmarks were slid to minimize the bending energy of the thin plate spline interpolation function relative to the updated Procrustes average. Generalized Procrustes Analysis was used to align landmark configurations and shape variation was summarized using a principal components analysis. Procrustes distances between species were calculated and resampling statistics were run to test for group differences. All subspecies demonstrate statistically different morphologies (p<0.005 for pairwise comparisons). G. b. graueri separates from other subspecies based on posterolateral morphology, with G. b. graueri demonstrating an elongated peroneal trochlea, and thus more bone superiorly than G. g. gorilla. Compared to G. b. beringei, G. b. graueri has less bone inferiorly near the tuberosity. Cuboid and posterior talar facet shapes correlate with arboreality. G. b. beringei (most terrestrial) has a flatter cuboid facet and a more transversely oriented/relatively smaller posterior talar facet than G. g. gorilla (most arboreal) and G. b. graueri represents an intermediate morphology. These differences demonstrate a relationship between calcaneal shape and locomotor behavior and suggest that G. b. graueri may load its foot differently from the other subspecies. This project was supported by NSF grant # BCS - 1824630.more » « less
-
In this note we establish hypocoercivity and exponential relaxation to the Maxwellian for a class of kinetic Fokker-Planck-Alignment equations arising in the studies of collective behavior. Unlike previously known results in this direction that focus on convergence near Maxwellian, our result is global for hydrodynamically dense flocks, which has several consequences. In particular, if communication is long-range, the convergence is unconditional. If communication is local then all nearly aligned flocks quantified by smallness of the Fisher information relax to the Maxwellian. In the latter case the class of initial data is stable under the vanishing noise limit, i.e. it reduces to a non-trivial and natural class of traveling wave solutions to the noiseless Vlasov-Alignment equation.The main novelty in our approach is the adaptation of a mollified Favre filtration of the macroscopic momentum into the communication protocol. Such filtration has been used previously in large eddy simulations of compressible turbulence and its new variant appeared in the proof of the Onsager conjecture for inhomogeneous Navier-Stokes system. A rigorous treatment of well-posedness for smooth solutions is provided. Lastly, we prove that in the limit of strong noise and local alignment solutions to the Fokker-Planck-Alignment equation Maxwellialize to solutions of the macroscopic hydrodynamic system with the isothermal pressure.more » « less
-
Diversification of animal vocalizations plays a key role in behavioral evolution and speciation. Vocal organ morphology represents an important source of acoustic variation, yet its small size, complex shape, and absence of homologous landmarks pose major challenges to comparative analyses. Here, we use a geometric morphometric approach based on geometrically homologous landmarks to quantify shape variation of laryngeal cartilages of four rodent genera representing three families. Reconstructed cartilages of the larynx from contrast-enhanced micro-CT images were quantified by variable numbers of three-dimensional landmarks placed on structural margins and major surfaces. Landmark sets were superimposed using generalized Procrustes analysis prior to statistical analysis. Correlations among pairwise Procrustes distances were used to identify the minimum number of landmarks necessary to fully characterize shape variation. We found that the five species occupy distinct positions in morphospace, with variation explained in part by phylogeny, body size, and differences in vocal production mechanisms. Our findings provide a foundation for quantifying the contribution of vocal organ morphology to acoustic diversification.more » « less
-
Multiple recent efforts have used large-scale data and computational models to automatically detect misinformation in online news articles. Given the potential impact of misinformation on democracy, many of these efforts have also used the political ideology of these articles to better model misinformation and study political bias in such algorithms. However, almost all such efforts have used source level labels for credibility and political alignment, thereby assigning the same credibility and political alignment label to all articles from the same source (e.g., the New York Times or Breitbart). Here, we report on the impact of journalistic best practices to label individual news articles for their credibility and political alignment. We found that while source level labels are decent proxies for political alignment labeling, they are very poor proxies-almost the same as flipping a coin-for credibility ratings. Next, we study the implications of such source level labeling on downstream processes such as the development of automated misinformation detection algorithms and political fairness audits therein. We find that the automated misinformation detection and fairness algorithms can be suitably revised to support their intended goals but might require different assumptions and methods than those which are appropriate using source level labeling. The results suggest caution in generalizing recent results on misinformation detection and political bias therein. On a positive note, this work shares a new dataset of journalistic quality individually labeled articles and an approach for misinformation detection and fairness audits.more » « less
An official website of the United States government

