Kevin Prybol kprybol

## tokenizations_post.md

      
              1 file
            
          
              2 forks
            
          
              0 comments
            
          
              64 stars
            
          
                tamuhey
                / tokenizations_post.md
            
            
              Last active
              June 26, 2024 01:00
            
              
                How to calculate the alignment between BERT and spaCy tokens effectively and robustly
              
          
    How to calculate the alignment between BERT and spaCy tokens effectively and robustly


site: https://tamuhey.github.io/tokenizations/
Natural Language Processing (NLP) has made great progress in recent years because of neural networks, which allows us to solve various tasks with end-to-end architecture. However, many NLP systems still require language-specific pre- and post-processing, especially in tokenizations. In this article, I describe an algorithm that simplifies calculating correspondence between tokens (e.g. BERT vs. spaCy), one such process. And I introduce Python and Rust libraries that implement this algorithm.
Here are the library and the demo site links:

repo: https://github.com/tamuhey/tokenizations


## slack_notifier.py
import os
import requests
import json

SLACK_WEBHOOK= os.environ.get("SLACK_WEBHOOK")


def send_message(messages, channel="abhishek", username="beast"):
    """
    :param messages: list of texts

## entropy_calculation_in_python.py
import numpy as np
from scipy.stats import entropy
from math import log, e
import pandas as pd

import timeit

def entropy1(labels, base=None):
  value,counts = np.unique(labels, return_counts=True)
  return entropy(counts, base=base)

## TemporalMaxPooling.py
from keras import backend as K
from keras.engine import InputSpec
from keras.engine.topology import Layer
import numpy as np


class TemporalMaxPooling(Layer):
    """
    This pooling layer accepts the temporal sequence output by a recurrent layer
    and performs temporal pooling, looking at only the non-masked portion of the sequence.

## Plotly-Julia-Set.ipynb

      
              1 file
            
          
              0 forks
            
          
              0 comments
            
          
              1 star
            
          
                empet
                / Plotly-Julia-Set.ipynb
            
            
              Last active
              July 4, 2016 14:04
            
              
                Escape time algorithm to get Plotly plot of a Julia set
              
          
        Loading

      Sorry, something went wrong. Reload?
      Sorry, we cannot display this file.
      Sorry, this file is invalid so it cannot be displayed.
      
          Viewer requires iframe.
      
    
## TensorFlow.jl - Real DFT.ipynb

      
              1 file
            
          
              1 fork
            
          
              0 comments
            
          
              1 star
            
          
                staticfloat
                / TensorFlow.jl - Real DFT.ipynb
            
            
              Created
              May 20, 2016 05:20
            
          
        Loading

      Sorry, something went wrong. Reload?
      Sorry, we cannot display this file.
      Sorry, this file is invalid so it cannot be displayed.
      
          Viewer requires iframe.
      
    
## IntroToJulia.ipynb

      
              1 file
            
          
              0 forks
            
          
              0 comments
            
          
              1 star
            
          
                dmrd
                / IntroToJulia.ipynb
            
            
              Created
              April 29, 2016 01:14
            
              
                Notebook walking through a few Julia examples
              
          
        Loading

      Sorry, something went wrong. Reload?
      Sorry, we cannot display this file.
      Sorry, this file is invalid so it cannot be displayed.
      
          Viewer requires iframe.
      
    
## powerpairing.ipynb

      
              1 file
            
          
              0 forks
            
          
              0 comments
            
          
              1 star
            
          
                czlee
                / powerpairing.ipynb
            
            
              Last active
              July 4, 2016 13:57
            
          
        Loading

      Sorry, something went wrong. Reload?
      Sorry, we cannot display this file.
      Sorry, this file is invalid so it cannot be displayed.
      
          Viewer requires iframe.
      
    
## Numpy4AnalysticsForward.ipynb

      
              1 file
            
          
              0 forks
            
          
              0 comments
            
          
              1 star
            
          
                cbcunc
                / Numpy4AnalysticsForward.ipynb
            
            
              Last active
              July 4, 2016 13:56
            
          
        Loading

      Sorry, something went wrong. Reload?
      Sorry, we cannot display this file.
      Sorry, this file is invalid so it cannot be displayed.
      
          Viewer requires iframe.
      
    
## julia_menu_problem.ipynb

      
              1 file
            
          
              0 forks
            
          
              0 comments
            
          
              1 star
            
          
                yoon
                / julia_menu_problem.ipynb
            
            
              Last active
              July 4, 2016 14:02
            
          
        Loading

      Sorry, something went wrong. Reload?
      Sorry, we cannot display this file.
      Sorry, this file is invalid so it cannot be displayed.
      
          Viewer requires iframe.
	import os
	import requests
	import json

	SLACK_WEBHOOK= os.environ.get("SLACK_WEBHOOK")


	def send_message(messages, channel="abhishek", username="beast"):
	"""
	:param messages: list of texts
	import numpy as np
	from scipy.stats import entropy
	from math import log, e
	import pandas as pd

	import timeit

	def entropy1(labels, base=None):
	value,counts = np.unique(labels, return_counts=True)
	return entropy(counts, base=base)
	from keras import backend as K
	from keras.engine import InputSpec
	from keras.engine.topology import Layer
	import numpy as np


	class TemporalMaxPooling(Layer):
	"""
	This pooling layer accepts the temporal sequence output by a recurrent layer
	and performs temporal pooling, looking at only the non-masked portion of the sequence.