Sana Hassan, Author at MarkTechPost

Author: Sana Hassan

470 POSTS0 COMMENTS

Sana Hassan, a consulting intern at Marktechpost and dual-degree student at IIT Madras, is passionate about applying technology and AI to address real-world challenges. With a keen interest in solving practical problems, he brings a fresh perspective to the intersection of AI and real-life solutions.

A New AI Research Releases SWIM-IR: A Large-Scale Synthetic Multilingual Retrieval Dataset with 28 Million Training Pairs over 33 Languages

AI ShortsNovember 19, 2023

Researchers from Google Research, Google DeepMind, and the University of Waterloo introduce SWIM-IR, a synthetic retrieval training dataset encompassing 33 languages, addressing the challenge...

Researchers from SJTU China Introduce TransLO: A Window-Based Masked Point Transformer Framework for Large-Scale LiDAR Odometry

AI ShortsNovember 17, 2023

Researchers from Shanghai Jiao Tong University and China University of Mining and Technology have developed TransLO. This LiDAR odometry network integrates a window-based masked...

Researchers from NTU Singapore Propose OtterHD-8B: An Innovative Multimodal AI Model Evolved from Fuyu-8B

AI ShortsNovember 14, 2023

Researchers from S-Lab, Nanyang Technological University, Singapore, introduce OtterHD-8B, an innovative multimodal model derived from Fuyu-8B, tailored to interpret high-resolution visual inputs precisely. Unlike...

This AI Paper from Google DeepMind Studies the Gap Between Pretraining Data Composition and In-Context Learning in Pretrained Transformers

AI ShortsNovember 13, 2023

Researchers from Google DeepMind explore the in-context learning (ICL) capabilities of large language models, specifically transformers, trained on diverse task families. However, their study...

Johannes Kepler University Researchers Introduce GateLoop: Advancing Sequence Modeling with Linear Recurrence and Data-Controlled State Transitions

AI ShortsNovember 11, 2023

A researcher from Johannes Kepler University has introduced GateLoop, a novel sequence model that leverages the potential of linear recurrence for efficient long-sequence modeling....

Koe AI Unveils LLVC: A Groundbreaking Real-Time Voice Conversion Model with Unparalleled Efficiency and Speed

AI ShortsNovember 10, 2023

A team of researchers from Koe AI introduced LLVC (Low-latency, Low-resource Voice Conversion), a model designed for real-time any-to-one voice conversion, characterized by ultra-low...

This AI Paper Introduces a Comprehensive Analysis of GPT-4V’s Performance in Medical Visual Question Answering: Insights and Limitations

AI ShortsNovember 10, 2023

A team of researchers from Lehigh University, Massachusetts General Hospital, and Harvard Medical School recently performed a thorough evaluation of GPT-4V, a state-of-the-art multimodal...

This AI Paper Has Moves: How Language Models Groove into Offline Reinforcement Learning with ‘LaMo’ Dance Steps and Few-Shot Learning

AI ShortsNovember 7, 2023

Researchers introduce Language Models for Motion Control (LaMo), a framework using Large Language Models (LLMs) for offline reinforcement learning. It leverages pre-trained LLMs to...

AWS Researchers Introduce Gemini: Pioneering Fast Failure Recovery in Large-Scale Deep Learning Training

AI ShortsNovember 6, 2023

A team of researchers from Rice University and Amazon Web Services have developed a distributed training system called GEMINI, which aims to improve failure...

Assessing the Linguistic Mastery of Artificial Intelligence: A Deep Dive into ChatGPT’s Morphological Skills Across Languages

AI ShortsNovember 4, 2023

Researchers rigorously examine ChatGPT's morphological abilities across four languages (English, German, Tamil, and Turkish). ChatGPT falls short compared to specialized systems, especially in English....

Unlocking Intent Alignment in Smaller Language Models: A Comprehensive Guide to Zephyr-7B’s Breakthrough with Distilled Supervised Fine-Tuning and AI Feedback

AI ShortsNovember 1, 2023

ZEPHYR-7B, a smaller language model optimized for user intent alignment through distilled direct preference optimization (dDPO) using AI Feedback (AIF) data. This approach notably...

Researchers from Meta and UNC-Chapel Hill Introduce Branch-Solve-Merge: A Revolutionary Program Enhancing Large Language Models’ Performance in Complex Language Tasks

AI Paper SummaryOctober 31, 2023

BRANCH-SOLVE-MERGE (BSM) is a program for enhancing Large Language Models (LLMs) in complex natural language tasks. BSM includes branching, solving, and merging modules to...

1...495051 Page 50 of 51