Get the latest tech news
LLMs can see and hear without any training
Code release for "LLMs can see and hear without any training" - facebookresearch/MILS
Also, download the 5000 samples test split used in Karpathy et al., Deep visual-semantic alignments for generating image descriptions, CVPR 2015. Update the variables in paths.py to set the dataset directory, and the output folder. MILS is made available under a CC-by-NC 4.0 license, however third party content pulled from other locations are subject to their own licenses and you may have other legal obligations or restrictions that govern your use of that content.
Or read this on Hacker News