This paper proposes a reliable multicast routing protocol based on Q-learning for wireless ad-hoc networks. The proposed protocol has two goals: 1) enhance the reliability of data delivery and 2) reduce the overhead caused by multicast routing. To achieve these goals, the protocol uses link reliability as a routing metric. The protocol chooses the most reliable path for data transmission based on its Q-value. In addition, it continuously updates the Q-value of active paths and proactively switches to another path if the current path becomes less reliable. To evaluate the performance of the proposed protocol, simulations were conducted using Network Simulator 3 (NS-3). The performance of the proposed protocol was compared with the Multicast Ad-hoc On-demand Distance Vector (MAODV) protocol. The simulation results show that the proposed protocol effectively enhances reliability as it outperforms the MAODV routing protocol in terms of Packet Delivery Ratio (PDR). Moreover, it reduces the control overhead caused by multicast routing. |