Gelbooru uses "Original Image" even when they link to webm, which is helpful, but like "og:image", it could be changed to 'video' in future. For the Original Image, you can use a String Match like so: with property="og:image" is easy to search for (and they use the same tag for video links as well!). Original image which can be found by putting a String Match in the html formula.When trying to pin down the right link, if there are no good alternatives, you often have to write several File URL rules with different precedence, saying 'get the "Click Here to See Full Size" link at 75' and 'get the embed's "src" at 25' and so on to make sure you cover different situations, but as it happens Gelbooru always posts the actual File URL at: If you have an account with the site you are parsing and have clicked the appropriate 'Always view original' setting, you may not see these sorts of sample-size banners! I recommend you log out of/go incognito for sites you are inspecting for hydrus parsing (unless a log-in is required to see content, so the hydrus user will have to set up hydrus-side login to actually use the parser), or you can easily NSFW-gates and other logged-out hurdles. If the booru shows 'sample' sizes for large images-as this one does!-pulling the src of the image you see won't get the full-size original for large images.If the booru also supports videos or flash, you'll have to write separate and likely more complicated rules for and tags.The secret md5 hash buried in the HTML.Ī tempting strategy for pulling the file URL is to just fetch the src of the embedded tag, but:.The different tags and their namespaces.What sorts of data are we interested in here?
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |