Training Details
Training data
Dolly is trained on ~15k instruction/response fine tuning records databricks-dolly-15k generated by Databricks employees in capability domains from the InstructGPT paper, including brainstorming, classification, closed QA, generation, information extraction, open QA and summarization.
Training Observations
The evaluation results indicate that dolly-v2-12b does not achieve state-of-the-art performance and, in certain evaluation benchmarks, it even falls short of dolly-v1-6b. The researchers hypothesize that this discrepancy can be attributed to the composition and size of the fine-tuning datasets used. However, a comprehensive understanding of the factors contributing to these variations necessitates additional investigation and analysis. Further studies are required to provide a more conclusive explanation for these observed differences.