TLDR AI 2023-10-17
Is AGI already here β, 800B tabular token dataset π, multimodal learning without paired data π―
Improved ROCm support (2 minute read)
AMD has improved support for its RDNA GPUs (including the RX 7900) and Pytorch for more training options for AI projects.
Startup Aims to Build Hundreds of Chip Factories with Prefab Parts and AI (6 minute read)
Nanotronics is a New York-based industrial AI company that wants to build an AI-enabled chip factory that can be assembled and expanded modularly with prefab pieces. Its Cubefabs system uses AI to take away the need for the specialization normally needed in a lab and allow people who are not semiconductor experts to work at the facilities. Each facility will need only about 30 people to operate. The bulk of each facility can be flat-packed and put in a shipping container.
TabLiB 800B tabular token dataset (2 minute read)
Dataset of tabular tokens to encourage the community to build Large Data Models that better understand tabular data. It is the largest publicly available tabular dataset - a compilation of 627 million tables together with 867 billion tokens of contextual information. TabLiB is available on Hugging Face.
π§
Research & Innovation
Advanced Head Pose Estimation (16 minute read)
Figuring out the direction someone's head is turned is important for lots of tech applications. These researchers have developed a new way to estimate head positions from any angle.
Boosting LiDAR-Camera Detection (14 minute read)
The authors of this paper have created a new technique called SupFusion to make LiDAR and camera systems work together better for detecting things like cars or pedestrians.
Multimodal Learning without Paired Data (GitHub Repo)
The research introduces Ex-MCR, a novel method that efficiently learns unified contrastive representations for multiple modalities without needing paired data. By aligning existing Multi-modal Contrastive Representations, Ex-MCR achieves top performance in tasks like audio-visual retrieval and 3D object classification.
π¨βπ»
Engineering & Research
MosaicFusion: A Tool to Make New Images without Training (GitHub Repo)
MosaicFusion is like a magic tool that can create new pictures with lots of objects without needing any prior learning. It does this in two steps: first, it makes the picture, and then it creates a mask to show where each object is.
Enhanced Earth Observation (GitHub Repo)
This study introduces a new method that combines digital surface model (DSM) data and aerial images from different times to improve change detection beyond just 2D perspectives.
Libgen to txt (GitHub Repo)
Libgen is likely a dataset in many closed models. While the legality of this dataset for commercial use is under debate, researchers are still using it to better understand data quality for language model training.
Get the most interesting AI stories and breakthroughs delivered in a free daily email.
Join 920,000 readers for
one daily email