Skip to content

Instantly share code, notes, and snippets.

@amita-shukla
Created March 24, 2020 20:44
Show Gist options
  • Save amita-shukla/c5ad4be3d02fe309b7cc6435efdcad92 to your computer and use it in GitHub Desktop.
Save amita-shukla/c5ad4be3d02fe309b7cc6435efdcad92 to your computer and use it in GitHub Desktop.
from nltk import word_tokenize
word_tokenize(sample_text5)

['Farewell', 'Blast', 'For', 'a', 'Hotel', 'In', 'Pittsburgh', 'A', 'controlled', 'implosion', 'top', 'begins', 'to', 'crumble', 'the', '16-story', 'Carlton', 'House', 'Hotel', 'in', 'downtown', 'Pittsburgh', 'Saturday', 'morning', '.', 'The', '28-vear-old', 'building', 'is', 'being', 'razed', 'to', 'make', 'way', 'for', 'Renaissance', 'II', ',', 'the', 'second', 'phase', 'of', 'a', 'major', 'redevelopment', 'of', 'the', 'city', '.', 'Thick', 'smoke', 'billows', 'from', 'the', 'building', 'center', 'as', 'more', 'than', '1,000', 'explosive', 'charges', 'do', 'their', 'work', '.', 'Seven', 'seconds', 'after', 'it', 'began', ',', 'the', 'demolition', 'is', 'completed', 'bottom', '.', 'Construction', 'of', 'a', '52-story', 'office', 'building', 'has', 'been', 'proposed', 'in', 'place', 'of', 'the', 'hotel', ',', 'where', 'former', 'Soviet', 'Premier', 'Khrushchev', 'stayed', 'while', 'visiting', 'the', 'city', 'during', 'his', 'tour', 'of', 'the', 'United', 'States', 'in', '1959.ROANOKE', 'UPI', 'Industry', 'analysts', 'believe', 'recent', 'labor', 'unrest', 'and', 'other', 'problems', 'in', 'Poland', ',', 'South', 'Africa', 'and', 'Australia', 'may', 'lead', 'foreign', 'coal', 'buyers', 'to', 'depend', 'more', 'heavily', 'on', 'the', 'United', 'States', 'than', 'in', 'recent', 'years.Poland', 'is', 'labor', 'changes', ',', 'racial', 'unrest', 'in', 'South', 'Africa', 'and', 'a', 'major', 'miners', "'", 'strike', 'in', 'Australia', 'the', 'world', 'is', 'three', 'largest', 'coal', 'exporters', 'behind', 'the', 'United', 'States', 'are', 'spreading', 'the', 'he-', "'", 'lief', 'that', 'the', 'United', 'States', 'may', 'be', 't', 'he', 'most', 'stable', 'source', 'of', 'coal', ',', 'analysts', "'", 'said', '.', 'The', 'United', 'States', "'", 'image', 'as', 'a', 'coal', 'supplier', 'has', 'made', 'a', 'big', 'comeback', '.', 'Our', 'problems', 'are', 'beginning', 'to', 'pale', 'in', 'com-parispn', 'with', 'other', 'coal', 'exporters', ',', 'said', 'Jack', 'Kawa', ',', 'a', 'coal', 'analyst', 'with', '.', 'Wheat', ',', 'First', 'Securities', '.', 'The', 'outlook', 'for', 'I', 'he', 'U.S.', 'export', 'is', 'getting', 'stronger', 'and', 'stronger', 'as', 'the', 'world', 'turns', 'to', 'coal', '.', 'Customers', 'are', 'turning', 'towards', 'American', 'coal', ',', 'ho', 'said', '.']

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment