Quantcast
Channel: help.fivefilters.org :: Full-Text RSS
Viewing all 219 articles
Browse latest View live

Move Link from enclosure link to link-tag

$
0
0
Is it possible to move the link in the Tag of a podcast feed to the Tag?

Is full text need very much traffic

$
0
0
After I subjected some feeds, the traffic increase to 1000g per month, is it right? How can i resolve this problem? thanks

error message

$
0
0
Hi, some of the feeds I use work fine but some don't (even if from the same source): Full-Text RSS is showing the following error: error on line 2 at column 1: Extra content at the end of the document Can I somehow resolve that problem? Kind regards, Matthias

JSON encoding issue

$
0
0
It looks like I am being returned unicode encoding in my json. For example, \u2019 for apostrophes (\u2019 is converted into ’). When I run debug with rawhtml it looks like the headers are specified as UTF-8. When I run as parsedhtml, I'm getting "Disallowed Key Characters." Any idea what's going on? Thanks!

Feature Request: Support array of URLs on the extract.php endpoint

$
0
0
Let's say I need to retrieve text content from 100 different web pages, instead of submitting 100 different requests to my Full-text RSS installation, I could submit them in batches of 10, which would significantly reduce the amount of work my client app needs to do. We could simply submit a JSONified array of URL/link strings in a POST HTTP request. Is this something you think you would want to support? Or is this something that is already possible, but I may have completely missed? (sorry, if that's the case) Thanks!

Site Config Update

$
0
0
Hello, I have been trying to update my custom site config file, but oddly with no success. I even deleted the entire config and still no change. Can this be caused by the config staying in the cache and not changing? If yes, is there any way to flush the cache? Thank you

Turning APC off & caching

$
0
0
Keyvan, I can confirm that disabling APC in config solves the problem of a site config file not loading. Thanks! However, if APC is off and I am processing several feeds, will I be re-loading all the site config files every time I call makefulltextfeed.php? I want to set up some global & site configs (strip_id_or_class: breadcrumbs) but it only works with APC off. Is there some way to keep APC on, but force a config into cache so it stays there? I currently cannot flush the APC cache due to other processes using it right now. But were I to leave APC enabled, and flush the cache, could I then expect the configs to properly get loaded and function until a future flush? Thanks for clarifying.

Overriding global strip

$
0
0
I'm wondering how to declare a div NOT to be stripped for a site, when stripped globally. I have the following in my global config, as I need to strip this title div from nearly ALL feeds I am extracting: strip_id_or_class: post-title But I have one feed where I don't want to strip post-title. I can't seem to figure out how to override the global file. As I understand it, the site specific rules should take precedence. Tried adding this to site specific file, with no luck: title: //h1[@class="post-title"]

hide [embedded content]

$
0
0
is it possible to hide "[embedded content]" message using extract.php

Custom output on extract

$
0
0
how can I custom output on extract.php ? (Simple mode)

Best strategy for processing multiple feeds

$
0
0
I submitted a similar question at the end of an older thread (Turning APC off), but wasn't sure if that thread is still actively monitored. My apologies for the duplication. I am using FTR to monitor several client feeds (300+) and am trying to figure out the most efficient way to extract NEW content. I currently check the feeds every few hours for new articles. When a new article is found, I store it in our db and from that point forward I will then only extract articles newer than the last one stored. Most of the feeds do not publish every day, and if they do they rarely publish more than one article a day. Still, because I can't know if, when or how often they publish, I am checking regularly. Also, in order to do the date compare, I'm obligated to pull back whatever is in the feed, checking the article pub date, then stopping once I reach an older date, and moving on to the next feed. Currently, I am looping through my list of hundreds of feeds, and calling makefulltextfeed.php for each feed -- array('format'=>'json','max'=> 100,'summary'=>1,'url'=>$this->feed_url) I am able to do this for about 70 at a time before I get a server error (500), which I am presently trying to debug. I'm wondering if there is a more efficient way to do this. From another thread (Feature Request: Support array of URLs on the extract.php endpoint) I see I can combine URLs in a single request to makefulltextfeed.php. I'm wondering if this strategy supports hundreds of URLs concatenated together, and if this would be more efficient. I could break it into fewer URLs per request, if that would help. Also, wondering if there is some more efficient way in which I can accomplish the date compare, to just check for new content. Your assistance is immensely appreciated. This is the last piece of the puzzle for publishing this service. This tool has been incredibly useful!

[unable to retrieve full-text content]

$
0
0
url: http://amanz.my/2014/12/telefon-pintar-berjenama-energizer-akan-muncul-tahun-hadapan/ demo: http://ftr.fivefilters.org/makefulltextfeed.php?url=http://amanz.my/2014/12/telefon-pintar-berjenama-energizer-akan-muncul-tahun-hadapan/&max=1

Normal RSS capture through full-text rss 3.4

$
0
0
Hi, im using Full-text RSS 3.4 selfhosted-version within a script to fetch a few feeds, put them togeter to one single feed and create a newspaper out of it. That works briliant! But, for some feeds, i would just dont need full-text rss visit every single item and try to catch the content - because the feed already displays everything i want. Is there a config option to just parse the feed by the given parameters (e.g. max=5) and not try to get full-text? This would be more easy for me, but of course, an pre-filtering would be possible in my script. Thank you, Josef

Wrong extraction of iframe

$
0
0
Hi, I'm using Full-Text RSS to extract content of articles, and obviously, those articles contains videos from YouTube, Vimeo... With iFrames. What I've found is that when you extract the content of those articles, the src of the iFrame looks like this: When it should be: Notice that on the first example it misses the "http:". How can I change it?

Restrict output to just 1st para + image of article?

$
0
0
Hello. I'm using a self-hosted implementation of Full-RSS. Is there an option to modify the output to consist of just the first paragraph and first image, instead of everything? The 'include excerpt' doesn't fit my downstream needs. My ultimate purpose is to create a Mailchimp RSS - based E-Mail campaign, which reads a RSS feed (in this case, the RSS of a Pinboard tag). Mailchimp inserts text in the tag (failing which, it reads the tag). I would ideally prefer just the first paragraph + image (MC reads the tag for images). Is there a way to limit the Full-RSS feed parsing to just these? I currently get the entire article, which doesn't work for the purpose of the newsletter. Thanks!

not complete extraxtion in qoute div

$
0
0
Dear Team, I'm Always have problem when to try to extraction from site who have quote div, content inside the "quote div" still missing, for example rss : http://www.shaanig.com/external.php?type=RSS2&forumids=51 or http://warez-serbia.com/software/rss.xml, i have try to Edit site patterns, but never worked please give me some right code to make full extraction when the feed has like quote div Thanks

[unable to retrieve full-text content]

$
0
0
Hi Yuhoo! You tried to create a full-text-feed from only one article! ;-) Try this URL in you Fivefilters-App: http://amanz.my/feed/ The result: http://ftr.fivefilters.org/makefulltextfeed.php?url=http%3A%2F%2Famanz.my%2Ffeed%2F&max=3

Some Feeds are not showing all items.

$
0
0
Hi! I'm using Full-Text RSS 3.4 (selfhosted) and I like it. But sometimes I'm stuck and I don't know why. Perhaps you've an explenation. So I'm trying to setup "my" newspaper. I want to add two feeds: http://www.faz.net/rss/aktuell/politik/ http://www.faz.net/rss/aktuell/feuilleton/ As you see: It's the same website. I want to limit to 25 items. When I try that with my first link, it works superb. When I try this with the second link, I've to limit to five (!) items. Otherwhise it will not work. Any ideas? Thanks! Gabe

Feed Creator: Ideas for a problematic site without links?

$
0
0
Hi! I would like to receive news from a special website: http://andreas-schule-bestwig.de/01_00_aktuelles.html As you see, there are no links. Every news is posted on one site. Take a look at the dates: 13.02.2015, 12.02.2015 etc. Is there any chance to generate a feed? Something like "If you find a date with is formated as H6, create a feed!". Thanks! Gabe

license question

$
0
0
I would like to use your full-text library in a commercial application. Would I become required to freely distribute my entire source code if I did so? That's the way I'm reading the APGL documentation, but then you offer a business license, so I'm confused. Any help appreciated.
Viewing all 219 articles
Browse latest View live