I'm following this guide which explains how to apply adapters to a model for a binary classification task, and I want to adapt it to a machine translation task.
In a typical Shapley value estimation for a numerical regression task, there is a clear way in which the marginal contribution of an input feature i to the fina