Skip to content

Instantly share code, notes, and snippets.

@ychoi-kr
Created June 1, 2021 08:47
Show Gist options
  • Save ychoi-kr/b44fdba4ef7accac36868ea1c7416249 to your computer and use it in GitHub Desktop.
Save ychoi-kr/b44fdba4ef7accac36868ea1c7416249 to your computer and use it in GitHub Desktop.
강화학습/심층강화학습 특강 편집
# 최고라고 생각하는 슬롯머신 표시하기
nSelected = nPosReward + nNegReward
for i in range(d):
print('Machine number ' + str(i+1) + ' was selected ' + str(nSelected[i]) + ' times')
print('Conclusion: Best machine is machine number ' + str(np.argmax(nSelected) + 1))
# 최고라고 생각하는 슬롯머신 표시하기
nSelected = nPosReward + nNegReward
for i in range(d):
print('Machine number ' + str(i+1) + ' was selected ' + str(nSelected[i]) +
' times')
print('Conclusion: Best machine is machine number ' + str(np.argmax(nSelected) + 1))
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment