Author: Nazmi Syed

Nazmi Syed
5 POSTS0 COMMENTS
Nazmi Syed is a consulting intern at MarktechPost and is pursuing a Bachelor of Science degree at the Indian Institute of Technology (IIT) Kharagpur. She has a deep passion for Data Science and actively explores the wide-ranging applications of artificial intelligence across various industries. Fascinated by technological advancements, Nazmi is committed to understanding and implementing cutting-edge innovations in real-world contexts.

Tucano: A Series of Decoder-Transformers Natively Pre-Trained in Portuguese

Natural Language Processing (NLP) has advanced significantly with deep learning, driven by innovations like word embeddings and transformer architectures. Self-supervised learning uses vast amounts...

Kinetix: An Open-Ended Universe of Physics-based Tasks for Reinforcement Learning

Self-supervised learning on offline datasets has permitted large models to reach remarkable capabilities both in text and image domains. Still, analogous generalizations for agents...

CMU Researchers Propose OpenFLAME: A Federated and Decentralized Localization Service

Maps are extensively used nowadays and are helpful in numerous location-based applications, including navigation, ride-sharing, fitness tracking, gaming, robotics, and augmented reality. As indoor...

Kwai-STaR: An AI Framework that Transforms LLMs into State-Transition Reasoners to Improve Their Intuitive Reasoning Capabilities

Large Language Models find it challenging to understand Mathematical reasoning. Mathematical reasoning involves various cognitive tasks like understanding and manipulating mathematical concepts, solving problems,...

Quantum Tunneling Meets AI: How Deep Neural Networks are Transforming Optical Applications

The quantum tunneling (QT) effect discovered in the 1920s was a major achievement in the field of quantum mechanics. Since there is a major...

DELTA: A Novel AI Method that Efficiently (10x Faster) Tracks Every Pixel in 3D Space from Monocular Videos

Tracking dense 3D motion from monocular videos remains challenging, particularly when aiming for pixel-level precision over long sequences. Existing methods face challenges in achieving...

CHESTNUT: A QoS Dataset for Mobile Edge Environments

Quality of Service (QoS) is a very important metric used to evaluate the performance of network services in mobile edge environments where mobile devices...

ConceptDrift: An AI Method to Identify Biases Using a Weight-Space Approach Moving Beyond Traditional Data-Restricted Protocols

Datasets and pre-trained models come with intrinsic biases. Most methods rely on spotting them by analyzing misclassified samples in a semi-automated human computer validation....

LongAlign: A Segment-Level Encoding Method to Enhance Long-Text to Image Generation

The rapid progress of text-to-image (T2I) diffusion models has made it possible to generate highly detailed and accurate images from text inputs. However, as...

Latent Action Pretraining for General Action models (LAPA): An Unsupervised Method for Pretraining Vision-Language-Action (VLA) Models without Ground-Truth Robot Action Labels

Vision-Language-Action Models (VLA) for robotics are trained by combining large language models with vision encoders and then fine-tuning them on various robot datasets; this...

CodeMMLU: A Comprehensive Multi-Choice Benchmark for Assessing Code Understanding in Large Language Models

Code Large Language Models (CodeLLMs) have predominantly focused on open-ended code generation tasks, often neglecting the critical aspect of code understanding and comprehension. Traditional...

LOONG: A New Autoregressive LLM-based Video Generator That can Generate Minute-Long Videos

Video Generation by LLMs is an emerging field with a promising growth trajectory. While Autoregressive Large Language Models (LLMs) have excelled in generating coherent...