MULTI-AGENT REINFORCEMENT LEARNING FOR NETWORK PROTOCOL SYNTHESIS