MERLE : Multimodal Elective Representation Learning of Evolution of birds