Search millions of research papers, visualize citation networks, and discover groundbreaking insights with our intelligent graph system
Research Papers
Citations Mapped
Graph Visualization
Discover the most cited and impactful AI research papers. Visualize interactive citation graphs, explore research connections, and navigate through paper networks across machine learning, computer vision, and natural language processing.
Kaiming He
X. Zhang
Shaoqing Ren
This work presents a residual learning framework to ease the training of networks that are substantially deeper than those used previously, and provides comprehensive empirical evidence showing that these residual networks are easier to optimize, and can gain accuracy from considerably increased depth.
Nikhila Ravi
Wan-Yen Lo
Ross B. Girshick
Laura Gustafson
Chloé Rolland
+7The Segment Anything Model (SAM) is introduced: a new task, model, and dataset for image segmentation, and its zero-shot performance is impressive – often competitive with or even superior to prior fully supervised results.
Zewar Shah
Shan Zhiyong
Adnan
This project aims to provide an efficient and robust real-time emotion identification framework that makes use of paralinguistic factors such as intensity, pitch, and MFCC and employs Diffusion Map to reduce data redundancy and high dimensionality.
Yuhai Wu
This chapter presents techniques for statistical machine learning using Support Vector Machines (SVM) to recognize the patterns and classify them, predicting structured objects using SVM, k-nearest neighbor method for classification, and Naive Bayes classifiers.
Gary S. Collins
K. Moons
P. Dhiman
R. Riley
A. L. Beam
+29The development of TRIPOD+AI is described and the expanded 27 item checklist with more detailed explanation of each reporting recommendation is presented, and the TRIPOD+AI for Abstracts checklist is presented.
Arpit Patidar
Abir Chakravorty
An innovative convolutional neural network architecture aimed at addressing challenges of detection and classification of apple fruit diseases is proposed and experimentally validated, achieving a remarkable classification accuracy of 95.37%.
S. McKinney
Todor Markov
Jacob Menick
I. Sutskever
N. Keskar
+274GPT-4, a large-scale, multimodal model which can accept image and text inputs and produce text outputs, is developed, a Transformer-based model pre-trained to predict the next token in a document which exhibits human-level performance on various professional and academic benchmarks.
Jean-Baptiste Alayrac
Jeff Donahue
Pauline Luc
Antoine Miech
Iain Barr
+22This work introduces Flamingo, a family of Visual Language Models (VLM) with this ability to bridge powerful pretrained vision-only and language-only models, handle sequences of arbitrarily interleaved visual and textual data, and seamlessly ingest images or videos as inputs.
Elaine C. Meng
Thomas D. Goddard
E. Pettersen
Gregory S. Couch
Zach J Pearson
+2New methods in the UCSF ChimeraX molecular modeling package are described that take advantage of machine‐learning structure predictions, provide likelihood‐based fitting in maps, and compute per‐residue scores to identify modeling errors.
O. Rainio
J. Teuho
R. Klén
The most common evaluation metrics used for the typical supervised ML tasks including binary, multi-class, and multi-label classification, regression, image segmentation, object detection, and information retrieval are introduced.
R. K. Sinha
Christoph Molnar. 2020. Interpretable Machine Learning: A Guide for Making Black Box Models Explainable, Lulu.com, pp. 318, ₹6690.
Noam Shazeer
Ashish Vaswani
Lukasz Kaiser
Jakob Uszkoreit
Niki Parmar
+3A new simple network architecture, the Transformer, based solely on attention mechanisms, dispensing with recurrence and convolutions entirely is proposed, which generalizes well to other tasks by applying it successfully to English constituency parsing both with large and limited training data.