Scrapebox Posting

What platforms can scrapebox post to?

Scrapebox can post to the following platforms (In 1.x it requires the learning most poster be installed):

4images
CopperMine Photo Gallery
Gallery V2
Pixel Post
Plogger
Advanced Guestbook
AkoBook
ArdGuestbook
Basti2Web Guestbook
BeepWorld Guestbook
Bella Guestbook
Burning Book
Chinese Blog
CMS2Day
CoderWorld Guestbook
DEDE Ims
DRBGuestbook
Easybook
GA Guestbook
Icybook Guestbook
Jambook Guestbook
Jax Guestbook
Plesk Sitebuilder
Aska BBS
ASP Blog
Blog Engine
Joomla Comment
K2 Blog
Serendipity
TextCube Guestbook
Wordpress Blog
e107 Forum
WP Contact Form 7 (Scrapebox 2.0 only)
WP Secure Contact Form (Scrapebox 2.0 only)
Wordpress Trackbacks

Slow Poster Mode (obsolete - 1.x only):
Wordpress
Moveable Type
B2Evolution
BlogEngine
Drupal
Expression Engine

 

Note: You can train additional platforms, but it is and intermediate/advanced level task.

If you export failed entries from posting, then repost to the failed ones, why do more succeed the 2nd time around?

There are many reason for this behavior.
- The site could have been off line or overloaded the first time around
- The connection could have timed out
- The proxy used the first time could have been bad

All this could mean that a 2nd. run could post some additional comments.  There are too many variables to try and control every time.  Best practice to maximize a run, is to lower connection and increase timeouts.  If you need maximum links from your runs, plan on reposing to the failed.

How do I specify the Anchor I want to use for my links?

Whatever you place in the names section is what will be used as your anchor.

How do you lock a specific anchor/name to a specific url?

You can lock a specific anchor to a specific url when posting.  This allows you to post to many urls with different anchors in the same posting run.   This feature is called Link Lock.

You use link lock in the websites.txt file.  It should be formatted like this:

http://www.site.com {anchor1|anchor2|etc..}
http://www.site1.com {anchor1|anchor2|etc..}

You still are required place something in the names.txt file, as this is the backup file that will be used if something goes wrong.  For instance if your spin syntax was messed up, scrapebox may not be able to understand what anchor you wanted, so it would use a line from the names file.

Scrapebox says X number were successful when posting, but the links aren't there

If you have ever commented on a blog manually then you might remember that sometimes you will get a notification from the blog saying that the comment was received.  However many times you get no notification from the blog.  When scrapebox receives that notification from the blog that the comment was received it reports it as successful.  When it receives no notification it assumes that it failed.  This does not mean that it did in fact fail, but scrapebox can not determine if it was successful or not.   Many of the blogs that show failed in scrapebox, actually received your comment.

The blog commenter is just that, a commenter.  It places the comments.  Most comments will then go to moderation and be approved or denied.  Some comments will be placed on auto approve blogs. Which is where the comment is automatically approved.  So the successful post, just means the comment was submitted.  It does not mean that the link is live on the site, that will be up to the admin to approve it or deny it.  Unless it is auto approve, in which case it goes live immediately.  If you check links right after you comment, those are going to be the ones that are auto approve.

What tokens can I use in the comments and messages file?

2.x Only:

  • %currentdate% - Inserts today's date.
  • %currenttime% - Inserts the current time.
  • %currentos% - Inserts the Operating System that is being used.
  • %newline% - Inserts a new line.  This is handy if you want to have multiple paragraphs or just format your text as you wish.
  • %domain% - Inserts the domain that the comment is being placed on.  So like if you are commenting/sending a message on domain1.com then you can say like "I love %domain% !" - and it would show up as "I love domain1.com !"

 

1.x ONLY:

The below tokens specifically pertain to regular Fast and Slow poster in the main Scrapebox window.  These tokens do not work with the learning mode addon.  It has its own specific tokens.  See the FAQ in the addons section here.

  • %NAME% will be replaced with the users name from Names.txt
  • %EMAIL% will be replaced with the users email from Emails.txt, hyperlinked (except BE, just the email)
  • %WEBSITE% will be replaced with the users website from Websites.txt, hyperlinked (except BE, just the url)
  • %BLOGURL% will be replaced with the blog’s url you are commenting on, hyperlinked (except BE, just the url)
  • %BLOGTITLE% will be replaced with the page title of the blog you are commenting on

Note: the %BLOGURL% and %BLOGTITLE% tokens will not parse when using the test comments button.  Meaning that when you click test comments and hit spin again and view your comments the first 3 tokens above will display what they are supposed to, but the blogurl and blogtitle will just display the token in the comment its self.

This is how it is supposed to be, because scrapebox does not know what blog it is posting to, so it can't render those.  When the actual comment is submitted it will then parse those two tokens and submit the correct content in the comment.

Trackbacks Note:  No tokens work with the trackback submitter.

Updated 3-16-2017

 

How does Fast Poster Connection Balancing work?

It splits your list in to 500 URL batches internally, the connections go down to zero momentarily after each 500 URL "burst' before the next 500 are posted to. This gives Windows and the network a short break to process outstanding messages etc and everything to free up.

It will slow down the comment run slightly, but can provide more stability on some peoples systems.

What determines the success rate when posting?

When you manually post to a blog, sometimes it will kick back and say something like "Your comment was successful".  When scrapebox receives this message it reports the post as successful.

If scrapebox does not receive a message to this extent, it reports the post as failed.  If you have done much manual blog commenting you know that that doesn't mean it failed.  Sometimes it simply accepts your comment and redirects you back to the post without any notification, or something else.  The comment might be successful in this case, but it just didn't kick back the notification so scrapebox can't report that it was successful, so it reports failed.

What is a Trackback?

In short a trackback is when site A links to site B and in turn site B links back to site A.  (more or less).  You do not have to link to the sites from your site for this to work, scrapebox sends trackbacks automatically and fakes it like you are linking to their site.  So that they will link to your site without you having to link to theirs.

For a technical definition you can read more on trackbacks here.

Can scrapebox post to https urls?

No, Scrapebox does not have sll libs needed to post to https urls.

Does scrapebox poster support nested spintax?

Yes the scrapebox commenter supports nested spintax.  For example.

{{Hi|Hello} {how are you|how have you been}|{Hope you are well|Hope you have been well}}

Can I use spin text in emails for posting?

Yes you can use spin text in emails in the poster, such as fast poster in the bottom right hand quadrant of the main scrapebox window.

For Example:

Matt{1|2|3}@{hotmail|gmail|yahoo}.com

I whipped up a template for you to use as well.  It uses a token, notice the %kw%. The concept here is that you go to the names tool and generate 100 names (or however many) and then paste them in the keyword box.

Important Note: Only use first names or last names NOT both, else there will be a space in between the first and last name, and clearly you can not put a space in an email address.

Then save off my email template below, into a text file.  Then use the M merge button to merge it in and get endless variations.   Then save those spun variations and use them for your emails.

The following should all be on one line.  I also create the text file, so you can just download it here (right click and choose "save as").

%kw%{{1|2|3|4|5|6|7|8|9|0}|{1|2|3|4|5|6|7|8|9|0}{1|2|3|4|5|6|7|8|9|0}|{1|2|3|4|5|6|7|8|9|0}{1|2|3|4|5|6|7|8|9|0}{1|2|3|4|5|6|7|8|9|0}|{1|2|3|4|5|6|7|8|9|0}{1|2|3|4|5|6|7|8|9|0}{1|2|3|4|5|6|7|8|9|0}{1|2|3|4|5|6|7|8|9|0}}@{hotmail|gmail|
yahoo|googlemail|ymail|aol|mac|zoho|outlook|live|mail|inbox|facebookmail}.com

If you need further explanation on the Merge feature, you can get more info here:

Using Tokens with the Merge feature in Scrapebox

When training the ScrapeBox Learning Mode Poster to new forms, what variables can be used?

For training platforms its best to start here:
http://www.scrapebox.com/training-new-platforms

These are the variables that can be used when training a new ini manually for the Learning Mode Poster / Custom Poster:

[SetUp]
PageMustContain=If any of the given strings can be found in the pagecode, the page is valid. | is interpreted as OR, * is interpreted as AND
PageMustNotContain=UIf any of the given strings can be found in the pagecode, the page is invalid. | is interpreted as OR, * is interpreted as AND
Success=If any of the given strings can be found in the resultpage after post, the submission was a success. | is interpreted as OR, * is interpreted as AND
Failed=If any of the given string can be found in the resultpage after post, the submission failed. | is interpreted as OR, * is interpreted as AND
LoadUrl=Locate the given url and load the target page. Will skip RemovefromUrl, RemoveFromUrlAfter, and ModifyUrl
LoadUrlFromAnchor=Locate the given anchor, grab the url and load the target page. Will skip RemovefromUrl, RemoveFromUrlAfter, and ModifyUrl
RemoveFromUrl=Remove given strings from the baseurl. Multiple strings are separated with |
RemoveFromUrlAfter=Remove everything from the position of given strings in the baseurl. Multiple strings are separated with |
ModifyUrl=Add something to the baseurl. variaboles %host% and %path% can be used to rebuild the baseurl.
DeleteCookies=list of cookie names to delete

[Step]
DoStepIf=Process this step only when any of the given strings can be found in the page code. | is interpreted as OR, * is interpreted as AND. If not set, the step will be processed always.
FormMustContain=The form is valid when any of the given strings can be found in the form. | is interpreted as OR, * is interpreted as AND
FormMustNotContain=If the form contains any of the given strings, the form is invalid. | is interpreted as OR, * is interpreted as AND
PostUrl=A | separated list of url parts used to grab the post url. It looks between <form and >
AddToPostUrl=A value added to post url. Masks (%...%) can be used.
DelayPost=Delay post by the given number of seconds. The variable %rndnum-x-y% can be use too.
DelayPostIf=Only delay the post when any of the listed strings can be found. Multiple strings are separated with |
AddToPostDataIfInpage=Will add all AddToPostData= fields when any of the with | separated strings is found in the pagecode.
AddToPostData=fieldname=variable will be added to the postdata when the AddToPostdataIfInPage condition is true. When no AddToPostDataIfInPage if set, AddToPostData will be added always.
EncodeFieldNames=1 will url encode fieldnames.

Variables:
All ini setting using variables allow spintax, for example thename={%rnd-name%|%rnd-email%} is valid. Values assigned to variables also allow spintax.
%host% Represents the host name of the target url
%path% Represents the path of the target url
%rnd-name% Returns a random name from the file ~cpn.txt. Spintax allowed.
%rnd-email% Returns a random email from the file ~cpe.txt Spintax allowed.
%rnd-website% Returns a random website from the file ~cpw.txt Spintax allowed.
%rnd-comment% Returns a random comment from the file ~cpc.txt Spintax allowed.
%rnd-option% Return a random option. Values are grabbed from the <select/option tags of the form
%rnd-location% Spintax allowed.
%rndnum-x-y% Returns a random number between x and y.
%ignore% Just use the original value represented in the form.
%user-domain% Extract the domain of the user's website generated previously by %rnd-website%
%user-name% Previously by %rnd-name% generated username
%user-email% Previously by %rnd-email% generated email
%user-comment% Previously by %rnd-comment% generated comment
%user-location% Previously by %rnd-location% generated location
%user-website% Previously by %rnd-website% generated website
%wphashcash% Result of WPHashCash processing (internal code)
%captcha% Image captcha result
%question% text captcha result
%serverstatus-200% Represents server status code 200
%serverstatus-302% Represents server status code 302
%header-xxxx% Checking the post header for the presence of xxxx in it.
%unixtimestamp%-returns the current unix timestamp
%unixtimestampms%-returns the current unix timestamp in milli seconds
%xxxxxx% Executing a section with the name xxxxx

Sections

[xxxxx]
Action=extract (extract a text between before and after)
Before= The text before the wanted part
After= The text after the wanted part
Default= If no part can be extracted, this is what will be used by default

[xxxxx]
Action=getfieldvalue (return the value of a field)
Fieldname=The name of the field

Other
processwpspamfree=1 use this to force the check for Wp-SpamFree

Failed MASK = Matches a Failed= ini response.

~~~~

When training form fields your looking for the name=X  field.

So for this sample form from the Icybook platform, some of the form code looks like

]
  <th class="newleft">*Name:</th>
      <td class="newright"><input type="text" size="30" name="autor" maxlength="30" value="" /></td>
    </tr>
    <tr>
      <th class="newleft">Email:</th>
      <td class="newright"><input type="text" size="30" name="email" maxlength="50" value="" /></td>
    </tr>
    <tr>
      <th class="newleft">Homepage:</th>
      <td class="newright"><input type="text" size="30" name="homepage" maxlength="50" value="http://" /></td>
    </tr>

So name="autor" is assoicated with the actual "Name" field where you put your name, so it would look like

autor=%rnd-name%

The above will look for the form field, which is on the left and populate that form field with what is on the right.  In this case it will get a random name from your names.txt file (which is the topmost of the 5 boxes that you load in in Scrapebox when posting).

Then for the email, which is the next box down that you load in, it would be

email=%rnd-email%

And for your website link, from websites.txt box

homepage=%rnd-website%

So under the [Step] section you would have

autor=%rnd-name%
email=%rnd-email%
homepage=%rnd-website%

The fields on the left of the equals sign are the X in "name=X" of the form data and the variables (With percent signs) on the right hand side of the equals sign pull data from the files you load into Scrapebox.

~~~~

Captcha data is stored in the captcha.def and textcaptcha.def files.

So if you use %captcha% then scrapebox will look in captcha.def for the before and after markers to get the image to send to the captcha solving service/program.

Same goes for if you use %question% it will look for the markers in the textcaptcha.def files.