Deakstadieđáhus: Resource allocation in wireless control systems via deep policy gradient