I am using tessnet2 OCR to read a pdf so I can extract some words/information.
When tessnet2 reads the file it compiles the results into a class..
Expand|Select|Wrap|Line Numbers
- public class Word
- {
- public int Blanks;
- public int Bottom;
- public List<Character> CharList;
- public double Confidence;
- public int FontIndex;
- public int Formating;
- public int Left;
- public int LineIndex;
- public int PointSize;
- public int Right;
- public object Tag;
- public string Text;
- public int Top;
- public Word();
- public override string ToString();
- }
Expand|Select|Wrap|Line Numbers
- string Text
Ok, so what I need help with is trying to use Linq to search through the Class result and find everything between
Expand|Select|Wrap|Line Numbers
- string word1 = "Number;";
Expand|Select|Wrap|Line Numbers
- string word 2 = "Vendor:";
Expand|Select|Wrap|Line Numbers
- List<tessnet2.Word> specificWord1 = result.FindAll(x => x.Text == "Number;");
- List<tessnet2.Word> specificWord2 = result.FindAll(x => x.Text == "Vendor:");