Towards Practical Offline Reinforcement Learning: Sample Efficient Policy Selection And Evaluation