EXPERIMENT AND RESULT Experiment Settings Dataset: Music artist recommendation: LastFM , LastFM *. Business recommendation: Yelp, Yelp*. Baselines: Max Entropy [1], Abs Greedy[2], CRM [3] , EAR [4], SCPR [5], UNICORN[6], and MCIPL[7]. [1] Lei, Wenqiang , et al. "Estimation-action-reflection: Towards deep interaction between conversational and recommender systems." Proceedings of the 13th International Conference on Web Search and Data Mining. 2020. [2] Christakopoulou , K., Radlinski , F., & Hofmann, K. (2016, August). Towards conversational recommender systems. In Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining (pp. 815-824). [3] Sun, Y., & Zhang, Y. (2018, June). Conversational recommender system. In The 41st international acm sigir conference on research & development in information retrieval (pp. 235-244). [4] Lei, Wenqiang , et al. "Estimation-action-reflection: Towards deep interaction between conversational and recommender systems." Proceedings of the 13th International Conference on Web Search and Data Mining. 2020. [5] Lei, Wenqiang , et al. "Interactive path reasoning on graph for conversational recommendation." Proceedings of the 26th ACM SIGKDD international conference on knowledge discovery & data mining. 2020. [6] Deng, Yang, et al. "Unified conversational recommendation policy learning via graph-based reinforcement learning." Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval. 2021. [7] Zhang, Yiming , et al. "Multiple choice questions based multi-interest policy learning for conversational recommendation." Proceedings of the ACM Web Conference 2022. 2022. Measurement : Success rate ( SR@t ): cumulative ratio of successful recommendations by the turn t Average turns (AT): average number of turns for all sessions. hDCG @(T, K): ranking performance of recommendations.