官方文档
https://www.e-iceblue.com/Tutorials/Spire.PDF/Program-Guide/Text/Find-and-replace-text-on-PDF-document-in-C.html
C# 在 PDF 中查找替换文本
原文件如下图,替换第一行的新编码,把41230441044替换为41230441000
替换代码如下:
PdfDocument doc = new PdfDocument();//加载PDF文档doc.LoadFromFile("Input.pdf"); PdfTextReplaceOptions textReplaceOptions = new PdfTextReplaceOptions();textReplaceOptions.ReplaceType = PdfTextReplaceOptions.ReplaceActionType.IgnoreCase;textReplaceOptions.ReplaceType = PdfTextReplaceOptions.ReplaceActionType.WholeWord;textReplaceOptions.ReplaceType = PdfTextReplaceOptions.ReplaceActionType.AutofitWidth;PdfPageBase page = doc.Pages[0];PdfTextReplacer textReplacer = new PdfTextReplacer(page);textReplacer.Options = textReplaceOptions;textReplacer.ReplaceAllText("41230441044", "41230441000");doc.SaveToFile("ReplaceTextInPage.pdf");doc.Dispose();
替换后格式乱了,如下图
解决方法如下,作用PdfTextFinder
参考文档https://www.e-iceblue.com/Tutorials/Spire.PDF/Spire.PDF-Program-Guide/Text/Find-and-Highlight-Selected-Text-in-PDF-in-C-VB.NET.html
C#/VB.NET 在 PDF 文档中查找并突出显示文本
PdfDocument doc = new PdfDocument();//加载PDF文档
doc.LoadFromFile("Input.pdf");
PdfPageBase page = doc.Pages[0];
PdfTextFinder finder = new PdfTextFinder(page);
var finds = finder.Find("41230441044");
foreach (var textFragment in finds)
{textFragment.ApplyRecoverString("41230441000", Color.White, true);
}
doc.SaveToFile("ReplaceTextInPage.pdf");
doc.Dispose();