Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
We don’t allow questions seeking recommendations for books, tools, software libraries, and more. You can edit the question so it can be answered with facts and citations.
Closed 8 years ago.
Improve this question
I would like the best way and method to convert from PDF to HTML, Note the PDFs contain different layouts, smart arts, images. Can you please suggest? I would actually prefer an API which I can use in a C# program and thus programmatically convert a number of files. I would prefer converting the images and embed them as base64 itself
Some time ago (2013), I developed a PDF to epub (a variation of HTML) converter.
I also wanted to develop in C# and looked what was available, but the best libraries are in C/C++. You probably know that PDF is a very tricky format, and even the best converters fail on some documents, so you really have to stick with the best options.
From C#, you can easily call C or C++ functions, so using a library in those languages is not be much of a problem.
Poppler http://poppler.freedesktop.org/ is the PDF library that I chose: It is based on Xpdf PDF viewer. It is reliable but you will have to postprocess the HTML code anyway. This package contains command line utilities including pdftohtml, a PDF to HTML converter. Sources files are also available.
Another very good option is PDFLib: http://www.pdflib.com/ It is a commercial product.
Related
Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
We don’t allow questions seeking recommendations for books, tools, software libraries, and more. You can edit the question so it can be answered with facts and citations.
Closed 4 years ago.
Improve this question
In a C# application, I must generate a pdf document from a specific template wich includes fields and images.
So I've thought of creating a ".tex" which would be my template, then modify it from the application and finally use a API to get a PDF file from the ".tex" file
But I also heard of some API to generate a PDF file directly without using latex. However, I think that it will not be easy to reproduce the template with the API.
What is the easiest way and the best API ?
I would recommend creating a HTML template and then converting it to PDF via https://code.google.com/p/wkhtmltopdf/ .
You can utilize Razor templating engine and get good Visual Studio support for creating you templates.
Its also easier to debug and fine-tune the result in a browser.
You can also find people to write HTML templates more easily.
Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
We don’t allow questions seeking recommendations for books, tools, software libraries, and more. You can edit the question so it can be answered with facts and citations.
Closed 4 years ago.
Improve this question
I am looking for an easy way to convert doc and docx extension files to pdf using C# and ASP.NET. I had previously used iTextSharp for the purpose but that requires creating a document from the scratch. But I want a way to convert the Word files as it is to pdf. Like if it includes images, tables etc they must be converted to pdf as it is. Is there any free library or code? Thanks in advance.
1) You can check this PDFConverter, it might helpful to you. It is a COM component, callable from .NET.
2) Or you can check this open source library PDFSharp.
3) And the third option is Aspose libraray
If you are able to buy a component then you can use Aspose Words, which is best for converting Word document to PDF as it is. There is no need to install Ms Office if this component is used.
Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
We don’t allow questions seeking recommendations for books, tools, software libraries, and more. You can edit the question so it can be answered with facts and citations.
Closed 8 years ago.
Improve this question
I am searching any kind of tool which can be convert any kind of extension of them (doc,docx,xls,xlsx,jpeg,jpg,txt,pdf,rtf) into pdf file in asp.net code like aspose tool but its too much costly i want same functionality like aspose tool but not much expensive.
Please suggest me any kind of tool like this....
Thanks
iTextSharp maybe would do the trick for you?
Here is a link for you
I am sure that if you want to put the time into it, the office com-interop objects can do it. Libre office has an api that can do it as well.
3-Heights have a component (Document Converter) that does this. For HTMl there are several other alternatives (see here: Convert HTML to PDF in .NET).
If you have full control over the web server, you could try to print those documents to a PDF printer or use an installed Acrobat Writer, though I wouldn't recommend this solution. It has somehow the smell of a hack...
Edit: 3 Heights Document Converter Service
Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
We don’t allow questions seeking recommendations for books, tools, software libraries, and more. You can edit the question so it can be answered with facts and citations.
Closed 8 years ago.
Improve this question
I'm now working on a .net/c# project requires to generate a contract pdf file (for printing and browsing purpose) based on some info traced from database.
The file also concludes several pages content which is fixed. It seems that crystal report does not deal well with multiple pages files. I also did research online, someone said use iTextSharp.
The question is the format of the file can be complicated. iTextSharp is kinda of not efficient on this issue.
Anyone has an idea?
PDF Sharp is an excellent library for this. They also have Migra-Doc which allows you to write documents to pdf, xps and rtf. The API is robust and based on GDI, pages shouldn't be a problem, you can even draw tables and stuff.
Quick Samples are here but download the project source, they have a hoard of good samples.
Please take a look at Windward Reports (I'm the CTO at Windward). With Windward you design in Word, Excel, or PowerPoint so anything, no matter how complex, that you can layout in Office, we can then render with data in PDF.
Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
We don’t allow questions seeking recommendations for books, tools, software libraries, and more. You can edit the question so it can be answered with facts and citations.
Closed 5 years ago.
Improve this question
I would need to edit a pdf at runtime in C#.NET. Is this possible? Like copying things in the pdf, editing textfields, ...?
Look at iTextSHARP. It's a .NET library for manipulating PDFs.
It is definitely possible, but not with what is built-in to .NET (unless you are a PDF format guru and can roll your own). I've mostly used Aspose for manipulating PDFs, but there are a lot of toolkits out there. Personally, I think Aspose is a little rough around the edges but not moreso than other kits.
Well there's iTextSharp : http://sourceforge.net/projects/itextsharp/ with which you can write pdf files, but I don't think it has editing (reading pdf) capabilities, or they will be limited.
I've never found a way to edit PDF's, unless you buy some third party tool ...
Look at this
iText# (iTextSharp) is a port of the iText open source java library for PDF generation written entirely in C# for the .NET platform.
Docotic.Pdf library may be used to edit PDF files. You can fill forms, import and export forms data, extract text, append documents and so on. Please take a look at samples for the library.
Disclaimer: I work for Bit Miracle.
It can be done, have a look at iTextSharp.
ITextSharp is good for adding things to a PDF.
These PDF components are also pretty good: www.tallcomponents.com