Robust Offline Policy Learning with Observational Data from Multiple Sources