Maybe I missed this but I was surprised they didn't mention the connection to correlation. Cosine similarity can be thought of as a correlation, and some bivariate distributions (normal I think?) can be rexpressed in terms of cosine similarity.
There's also some generalizations to higher dimensional notions of cosines that are kind of interesting.
There's also some generalizations to higher dimensional notions of cosines that are kind of interesting.