A large-scale audit of dataset licensing and attribution in AI Nature Machine Intelligence