> Transformer-based language models — which include BERT, RoBERTa, and IBM’s Slate and Granite family of models
Why would they not mention the most popular transformer based language models?
IBM's business model is to be worse but sell to lots of clients because the clients don't know any better.
> Transformer-based language models — which include BERT, RoBERTa, and IBM’s Slate and Granite family of models
Why would they not mention the most popular transformer based language models?