@nicolay-r on Hugging Face: "📢 Releasing the Chain-of-Thought (CoT)-tuned 🔥 FlanT5-xl (3B) for Target…"

Post

1648

📢 Releasing the Chain-of-Thought (CoT)-tuned 🔥 FlanT5-xl (3B) for Target Sentiment Analysis (TSA) on english texts.
💡 The main reason for adopting this model or smaller version (large and base) are as follows:
✅ 1. Reasoning in sentiment-analysis in zero-shot-learning mode significantly underperforms the fine-tuned FlanT5.
✅ 2. This model showcases top 1 🏆 on the RuSentNE-2023 competitions: https://codalab.lisn.upsaclay.fr/competitions/9538
✅ 3. Easy colab for frameworkless lauch and experiments 🧪 https://colab.research.google.com/github/nicolay-r/Reasoning-for-Sentiment-Analysis-Framework/blob/main/FlanT5_Finetuned_Model_Usage.ipynb

You may find more on the model card, while the fine-tuning statistics per each model size is shown in attachment.

Model: nicolay-r/flan-t5-tsa-thor-xl
Benchmark: https://github.com/nicolay-r/RuSentNE-LLM-Benchmark
Dataset: https://github.com/dialogue-evaluation/RuSentNE-evaluation
Related paper: Large Language Models in Targeted Sentiment Analysis (2404.12342)
Collection: https://huggingface.co./collections/nicolay-r/sentiment-analysis-665ba391e0eba729021ea101