Navigation Menu

Skip to content

Instantly share code, notes, and snippets.

@conholdate-gists
Last active June 29, 2021 07:47
Show Gist options
  • Star 0 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save conholdate-gists/f1699fc0dd9d47653ace12440f857337 to your computer and use it in GitHub Desktop.
Save conholdate-gists/f1699fc0dd9d47653ace12440f857337 to your computer and use it in GitHub Desktop.
Classify PDF Documents using C#
Classify PDF Documents using C#
1. C# API for Document Classification
2. Classify PDF Documents with IAB-2 Taxonomy using C#
3. Classify PDF with Documents Taxonomy using C#
4. PDF Document Classification from Document Stream using C#
5.Classify Password Protected PDF using C#
// create classifier
var classifier = new Classifier();
// classify password-protected document
var response = classifier.Classify("password-protected.pdf", @"C:\Files\", password: "password");
// show classification information
Console.WriteLine(response.BestClassName, response.BestClassProbability);
// create classifier
var classifier = new Classifier();
// classify document with Document taxonomy
var response = classifier.Classify("sample.pdf", @"C:\Files\", 4, Taxonomy.Documents, PrecisionRecallBalance.Precision);
// show classification information
foreach (var r in response.BestResults)
{
Console.WriteLine("ClassName: " + r.Name);
Console.WriteLine("ClassProbability: " + r.Probability);
Console.WriteLine("--------------------------------");
}
// create classifier
var classifier = new Classifier();
// classify document with IAB-2
var response = classifier.Classify("sample.pdf", @"C:\Files\", 3, Taxonomy.Iab2);
// show classification information
foreach (var r in response.BestResults)
{
Console.WriteLine("ClassName: " + r.Name);
Console.WriteLine("ClassProbability: " + r.Probability);
Console.WriteLine("--------------------------------");
}
using (var fs = File.OpenRead(Path.Combine(@"C:\Files\", "sample.pdf")))
{
// create classifier
var classifier = new Classifier();
// classify document
var response = classifier.Classify(fs, "sample.pdf", 2, Taxonomy.Documents);
// show classification information
Console.WriteLine($"{"sample.pdf"}: {response.BestClassName}, {response.BestClassProbability}");
}
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment