"Fine-tuning Large Language Models for Entity Matching"

Playback speed

Share post at current time

0:00

Transcript

"Fine-tuning Large Language Models for Entity Matching"

The podcast on this paper is generated with Google's Illuminate.

Rohan Paul

Dec 27, 2024

Innovative idea: Using structured explanations explicitly mentioning attributes, importance, and similarity to augment training data for improved LLM fine-tuning in entity matching.

📚 https://arxiv.org/pdf/2409.08185

Original Problem 💡:

Existing research on using LLMs for entity matching focused on prompt engineering and in-context learning. This paper explores the potential of fine-tuning LLMs for improved performance and generalization.

-----

Solution in this Paper 🧠:

• Analyzes fine-tuning along two dimensions:

- Representation of training examples (adding LLM-generated explanations)

- Selection and generation of training examples using LLMs

• Experiments with:

- Standard fine-tuning

- Augmenting training data with textual/structured explanations

- Filtering training sets

- Generating synthetic training examples

- Error-based example selection

-----