Skip to content

Instantly share code, notes, and snippets.

@onuryartasi
Last active June 30, 2023 13:36
Show Gist options
  • Save onuryartasi/7b861ff3cff77bcf68846db3bec0b2a6 to your computer and use it in GitHub Desktop.
Save onuryartasi/7b861ff3cff77bcf68846db3bec0b2a6 to your computer and use it in GitHub Desktop.
Google Colab Using GPU with Tensorflow version 1.0.0
Display the source blob
Display the rendered blob
Raw
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
@MohammadHarisZia
Copy link

@adnvenkatesh can you tell me were the responses any good and also that i have an xps 15 9570 so it has thermal limitations, can you send me your ckpt files to just test it?

@adnvenkatesh
Copy link

As i have mentioned in the above discussion the error did not converge at all. The responses were very bad..just repitions of random strings.I have read in various sources about what might be the problem. I saw in some sources that adams optimizer has convergence problems. So i have implemented gradient descent optimizer and made some changes and got this to work

@MohammadHarisZia
Copy link

Can you kindly share your code ? I just need to understand the issues. Anyway, million thanks. Means a lot tbh for the help.

@adnvenkatesh
Copy link

adnvenkatesh commented Jan 22, 2022

On the github somewhere if i remember correctly there are some very good implementations of gradient descent optimizer in this context in addition with the bidirectional encoder layers. Try using them for understanding. They are far more clear. My code was too messy for you to understand it.😅

@Sudar88
Copy link

Sudar88 commented Feb 11, 2022

Hey all,
Has anyone, in the recent period according to the given instructions, managed to run training on Colab of this model https://www.udemy.com/course/chatbot/ ?

@MohammadHarisZia
Copy link

Hi there @Sudar88, Unfortunately I tried alot but could not make it work out.

@Sudar88
Copy link

Sudar88 commented Feb 11, 2022

@MohammadHarisZia thanks for the answer. After many attempts I was not able to run training this model according to the given instructions.
If anyone has found a solution, I hope they will respond.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment