AI Research - Blog

About Whitepaper Security Blog

AI Research

Curating Pioneering Research

U1U2U3

Join Our Community of Contributors

/ LLMLLM

Adapting Large Language Models via Reading Comprehension
Prof. Otto NomosMay 27, 2024 ∙ 1 min read
OpenBA: An Open-sourced 15B Bilingual Asymmetric seq2seq Model Pre-trained from Scratch
Prof. Otto NomosMay 27, 2024 ∙ 1 min read
PDFTriage: Question Answering over Long, Structured Documents
Prof. Otto NomosMay 27, 2024 ∙ 1 min read
Sorted LLaMA: Unlocking the Potential of Intermediate Layers of Large Language Models for Dynamic Inference Using Sorted Fine-Tuning (SoFT)
Prof. Otto NomosMay 27, 2024 ∙ 1 min read
An Empirical Study of Scaling Instruct-Tuned Large Multimodal Models
Prof. Otto NomosMay 27, 2024 ∙ 1 min read
MindAgent: Emergent Gaming Interaction
Prof. Otto NomosMay 27, 2024 ∙ 1 min read
Struc-Bench: Are Large Language Models Really Good at Generating Complex Structured Data?
Prof. Otto NomosMay 25, 2024 ∙ 1 min read
Recovering from Privacy-Preserving Masking with Large Language Models
Prof. Otto NomosMay 25, 2024 ∙ 1 min read
S3-DST: Structured Open-Domain Dialogue Segmentation and State Tracking in the Era of LLMs
Prof. Otto NomosMay 25, 2024 ∙ 1 min read
Augmenting text for spoken language understanding with Large Language Models
Prof. Otto NomosMay 25, 2024 ∙ 1 min read
Language Modeling Is Compression
Prof. Otto NomosMay 25, 2024 ∙ 1 min read
Baichuan 2: Open Large-scale Language Models
Prof. Otto NomosMay 24, 2024 ∙ 1 min read
Stabilizing RLHF through Advantage Model and Selective Rehearsal
Prof. Otto NomosMay 24, 2024 ∙ 1 min read
Chain-of-Verification Reduces Hallucination in Large Language Models
Prof. Otto NomosMay 24, 2024 ∙ 1 min read
LMDX: Language Model-based Document Information Extraction and Localization
Prof. Otto NomosMay 24, 2024 ∙ 1 min read
SlimPajama-DC: Understanding Data Combinations for LLM Training
Prof. Otto NomosOct 04, 2023 ∙ 1 min read
Contrastive Decoding Improves Reasoning in Large Language Models
Prof. Otto NomosOct 04, 2023 ∙ 1 min read
CulturaX: A Cleaned, Enormous, and Multilingual Dataset for Large Language Models in 167 Languages
Prof. Otto NomosOct 04, 2023 ∙ 1 min read
A Data Source for Reasoning Embodied Agents
Prof. Otto NomosOct 04, 2023 ∙ 1 min read
Leveraging Contextual Information for Effective Entity Salience Detection
Prof. Otto NomosOct 04, 2023 ∙ 1 min read
LASER: LLM Agent with State-Space Exploration for Web Navigation
Prof. Otto NomosOct 04, 2023 ∙ 1 min read
Sparse Autoencoders Find Highly Interpretable Features in Language Models
Prof. Otto NomosOct 04, 2023 ∙ 1 min read
Investigating Answerability of LLMs for Long-Form Question Answering
Prof. Otto NomosOct 04, 2023 ∙ 1 min read
Scaling Laws for Sparsely-Connected Foundation Models
Prof. Otto NomosOct 04, 2023 ∙ 1 min read
Connecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt Optimizers
Prof. Otto NomosOct 04, 2023 ∙ 1 min read
Ambiguity-Aware In-Context Learning with Large Language Models
Prof. Otto NomosOct 04, 2023 ∙ 1 min read
Are Large Language Model-based Evaluators the Solution to Scaling Up Multilingual Evaluation?
Prof. Otto NomosOct 04, 2023 ∙ 1 min read
Clinical Text Summarization: Adapting Large Language Models Can Outperform Human Experts
Prof. Otto NomosOct 04, 2023 ∙ 1 min read
Agents: An Open-source Framework for Autonomous Language Agents
Prof. Otto NomosOct 04, 2023 ∙ 1 min read
Statistical Rejection Sampling Improves Preference Optimization
Prof. Otto NomosOct 04, 2023 ∙ 1 min read
Large Language Models for Compiler Optimization
Prof. Otto NomosOct 03, 2023 ∙ 1 min read
AstroLLaMA: Towards Specialized Foundation Models in Astronomy
Prof. Otto NomosOct 03, 2023 ∙ 1 min read
Large Language Model for Science: A Study on P vs. NP
Prof. Otto NomosOct 03, 2023 ∙ 1 min read
Efficient Memory Management for Large Language Model Serving with PagedAttention
Prof. Otto NomosOct 03, 2023 ∙ 1 min read
FIAT: Fusing learning paradigms with Instruction-Accelerated Tuning
Prof. Otto NomosOct 03, 2023 ∙ 1 min read
Optimize Weight Rounding via Signed Gradient Descent for the Quantization of LLMs
Prof. Otto NomosOct 03, 2023 ∙ 1 min read
When Less is More: Investigating Data Pruning for Pretraining LLMs at Scale
Prof. Otto NomosOct 03, 2023 ∙ 1 min read
Neurons in Large Language Models: Dead, N-gram, Positional
Prof. Otto NomosOct 03, 2023 ∙ 1 min read
Textbooks Are All You Need II: phi-1.5 technical report
Prof. Otto NomosOct 03, 2023 ∙ 1 min read
DrugChat: Towards Enabling ChatGPT-Like Capabilities on Drug Molecule Graphs
Prof. Otto NomosOct 03, 2023 ∙ 1 min read
From Sparse to Dense: GPT-4 Summarization with Chain of Density Prompting
Prof. Otto NomosOct 03, 2023 ∙ 1 min read
XGen-7B Technical Report
Prof. Otto NomosOct 03, 2023 ∙ 1 min read
DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models
Prof. Otto NomosOct 03, 2023 ∙ 1 min read
GPT Can Solve Mathematical Problems Without a Calculator
Prof. Otto NomosOct 03, 2023 ∙ 1 min read
Large Language Models as Optimizers
Prof. Otto NomosOct 03, 2023 ∙ 1 min read
Efficient RLHF: Reducing the Memory Usage of PPO
Prof. Otto NomosOct 03, 2023 ∙ 1 min read
ModelScope-Agent: Building Your Customizable Agent System with Open-source Large Language Models
Prof. Otto NomosOct 03, 2023 ∙ 1 min read
Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback [Summary]
Prof. Otto NomosOct 02, 2023 ∙ 1 min read
Llama 2: Open Foundation and Fine-Tuned Chat Models [Commentary]
Prof. Otto NomosOct 02, 2023 ∙ 1 min read
Challenges and Applications of Large Language Models [Summary]
Prof. Otto NomosOct 02, 2023 ∙ 1 min read
LoraHub: Efficient Cross-Task Generalization Via Dynamic LoRA Composition [Commentary]
Prof. Otto NomosOct 02, 2023 ∙ 1 min read
ToolLLM: Facilitating Large Language Models To Master 16000+ Real-World APIs [Commentary]
Prof. Otto NomosOct 02, 2023 ∙ 1 min read
FacTool: Factuality Detection in Generative AI -- A Tool Augmented Framework for Multi-Task and Multi-Domain Scenarios
Prof. Otto NomosOct 02, 2023 ∙ 1 min read
Graph of Thoughts: Solving Elaborate Problems with Large Language Models
Prof. Otto NomosOct 02, 2023 ∙ 1 min read
Efficient Guided Generation for Large Language Models
Prof. Otto NomosOct 02, 2023 ∙ 1 min read
Predicting transcriptional outcomes of novel multigene perturbations with GEARS
Prof. Otto NomosOct 02, 2023 ∙ 1 min read
A Survey on Model Compression for Large Language Models
Prof. Otto NomosOct 02, 2023 ∙ 1 min read
From Pretraining Data to Language Models to Downstream Tasks: Tracking the Trails of Political Biases Leading to Unfair NLP Models
Prof. Otto NomosOct 02, 2023 ∙ 1 min read
LLM As DBA
Prof. Otto NomosOct 02, 2023 ∙ 1 min read
Self-Alignment with Instruction Backtranslation
Prof. Otto NomosOct 02, 2023 ∙ 1 min read
RLAIF: Scaling Reinforcement Learning from Human Feedback with AI Feedback
Prof. Otto NomosOct 02, 2023 ∙ 1 min read
Large Content And Behavior Models To Understand, Simulate, And Optimize Content And Behavior
Prof. Otto NomosOct 02, 2023 ∙ 1 min read
BioCoder: A Benchmark for Bioinformatics Code Generation with Contextual Pragmatic Knowledge
Prof. Otto NomosOct 02, 2023 ∙ 1 min read
The Belebele Benchmark: a Parallel Reading Comprehension Dataset in 122 Language Variants
Prof. Otto NomosOct 02, 2023 ∙ 1 min read
Can Programming Languages Boost Each Other via Instruction Tuning?
Prof. Otto NomosOct 02, 2023 ∙ 1 min read
WeatherBench 2: A benchmark for the next generation of data-driven global weather models
Prof. Otto NomosOct 02, 2023 ∙ 1 min read
Jais and Jais-chat: Arabic-Centric Foundation and Instruction-Tuned Open Generative Large Language Models
Prof. Otto NomosOct 02, 2023 ∙ 1 min read
MedAlign: A Clinician-Generated Dataset for Instruction Following with Electronic Medical Records
Prof. Otto NomosOct 02, 2023 ∙ 1 min read
SoTaNa: The Open-Source Software Development Assistant
Prof. Otto NomosOct 02, 2023 ∙ 1 min read
Teach LLMs to Personalize -- An Approach inspired by Writing Education
Prof. Otto NomosOct 02, 2023 ∙ 1 min read
RAVEN: In-Context Learning with Retrieval Augmented Encoder-Decoder Language Models
Prof. Otto NomosOct 02, 2023 ∙ 1 min read
Solving Challenging Math Word Problems Using GPT-4 Code Interpreter with Code-based Self-Verification
Prof. Otto NomosOct 02, 2023 ∙ 1 min read
The Devil is in the Errors: Leveraging Large Language Models for Fine-grained Machine Translation Evaluation
Prof. Otto NomosOct 02, 2023 ∙ 1 min read
CausalLM is not optimal for in-context learning
Prof. Otto NomosOct 02, 2023 ∙ 1 min read
Platypus: Quick, Cheap, and Powerful Refinement of LLMs
Prof. Otto NomosOct 02, 2023 ∙ 1 min read
OctoPack: Instruction Tuning Code Large Language Models
Prof. Otto NomosOct 02, 2023 ∙ 1 min read
Enhancing Network Management Using Code Generated by Large Language Models
Prof. Otto NomosOct 02, 2023 ∙ 1 min read
Improving Joint Speech-Text Representations Without Alignment
Prof. Otto NomosOct 02, 2023 ∙ 1 min read
BOLAA: Benchmarking and Orchestrating LLM-augmented Autonomous Agents
Prof. Otto NomosOct 02, 2023 ∙ 1 min read
PIPPA: A Partially Synthetic Conversational Dataset
Prof. Otto NomosOct 02, 2023 ∙ 1 min read
Self-Alignment with Instruction Backtranslation
Prof. Otto NomosOct 02, 2023 ∙ 1 min read
OpenProteinSet: Training data for structural biology at scale
Prof. Otto NomosOct 02, 2023 ∙ 1 min read
Trustworthy LLMs: a Survey and Guideline for Evaluating Large Language Models' Alignment
Prof. Otto NomosOct 02, 2023 ∙ 1 min read
Accelerating LLM Inference with Staged Speculative Decoding
Prof. Otto NomosOct 02, 2023 ∙ 1 min read
Shepherd: A Critic for Language Model Generation
Prof. Otto NomosOct 02, 2023 ∙ 1 min read
SILO Language Models: Isolating Legal Risk In a Nonparametric Datastore
Prof. Otto NomosOct 02, 2023 ∙ 1 min read
Simple synthetic data reduces sycophancy in large language models
Prof. Otto NomosOct 02, 2023 ∙ 1 min read

/ RLHFRLHF

Stabilizing RLHF through Advantage Model and Selective Rehearsal
Prof. Otto NomosMay 24, 2024 ∙ 1 min read
Statistical Rejection Sampling Improves Preference Optimization
Prof. Otto NomosOct 04, 2023 ∙ 1 min read
Efficient RLHF: Reducing the Memory Usage of PPO
Prof. Otto NomosOct 03, 2023 ∙ 1 min read
Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback [Summary]
Prof. Otto NomosOct 02, 2023 ∙ 1 min read
RLAIF: Scaling Reinforcement Learning from Human Feedback with AI Feedback
Prof. Otto NomosOct 02, 2023 ∙ 1 min read

/ Reinforcement LearningReinforcement Learning

Statistical Rejection Sampling Improves Preference Optimization
Prof. Otto NomosOct 04, 2023 ∙ 1 min read