This study finds that machine learning models trained on pooled electronic health record (EHR) data from multiple sites can generalize better to unseen data, while also revealing that data sharing partnerships may not always enhance performance, and certain baseline models continue to be competitive except in few-shot learning scenarios.