Author: Divyesh Vitthal Jawkhede

Divyesh Vitthal Jawkhede
27 POSTS0 COMMENTS
Divyesh is a consulting intern at Marktechpost. He is pursuing a BTech in Agricultural and Food Engineering from the Indian Institute of Technology, Kharagpur. He is a Data Science and Machine learning enthusiast who wants to integrate these leading technologies into the agricultural domain and solve challenges.

Baichuan-Omni: An Open-Source 7B Multimodal Large Language Model for Image, Video, Audio, and Text Processing

Recent advancements in Large Language Models (LLMs) have reshaped the Artificial intelligence (AI)landscape, paving the way for the creation of Multimodal Large Language Models...

Google DeepMind Introduces DeepMind Control Vision Benchmark (DMC-VB): A Dataset and Benchmark to Evaluate the Robustness of Offline Reinforcement Learning Agents to Visual Distractors

Reinforcement learning (RL)  provides a framework for learning behaviors for control and making decisions (known as policies) that help the model earn the most...

MIBench: A Comprehensive AI Benchmark for Model Inversion Attack and Defense

A Model Inversion (MI) attack is a type of privacy attack on machine learning and deep learning models, where an attacker tries to invert...

Are LLMs Failing to Match with Suffix in Fill-in-the-Middle (FIM) Code Completion? Horizon-Length Prediction: A New AI Training Task to Advance FIM by Teaching...

While writing the code for any program or algorithm, developers can struggle to fill gaps in incomplete code and often make mistakes while trying...

Dynamic Contrastive Decoding (DCD): A New AI Approach that Selectively Removes Unreliable Logits to Improve Answer Accuracy in Large Vision-Language Models

Large Vision-Language Models (LVLMs) have demonstrated impressive capabilities for capturing and reasoning over multimodal inputs and can process both images and text. While LVLM...