Anfonwch hwn fel neges destun: Resource allocation in wireless control systems via deep policy gradient