The first question is relatively easy. Valence Bond slightly predates MO theory. When Linus Pauling proposed orbital hybridization as an outshoot of VB theory, MO theory was still in its infancy and wouldn't be applied to real molecules for several more years. Even beyond that, determining the tetrahedral shape of methane with hybridization/VB is a lot easier than determining it with MO theory, especially without the more recent mathematical tools like group theory and symmetry treatments.
The second question I think is predicated on a false impression that MO theory is somehow superior to VB theory. This isn't really the case. It's true that MO theory is more complex and is far more developed than VB, but this doesn't mean it's always better. VB approaches are still used today in many theoretical calculations of organic molecules. Ironically enough, while I complain about organic chemistry courses doing a disservice to students by glossing over MO theory, I could just as easily complain about physical chemistry courses doing the same thing by glossing over VB theory as a mere historical footnote.
That said, I do think MO theory has a far wider range of applications than VB theory. VB theory uses only atomic orbitals with valence (bonding) electrons to form localized chemical bonds, with most electron density located between neighboring atoms. This may be appropriate in most cases, and indeed theoretical treatments using VB and MO theories very frequently give, within a margin of error, similar results for chemical structure, reactivity, and electron density. VB theory is, I believe, computationally simpler (because only valence orbitals are involved), and so it is often relied upon for large or complex molecules, or where a qualitative result is sufficient. Just as with VB theory, an MO calculation will also tend to show that most electron density is localized between adjacently bonded atoms - although because of its better stage of development, results from modern MO calculations are usually more detailed and offer a better quantitative match to experimental data in many cases. This is especially true in molecules in which delocalized electrons are clearly important, because MO theory allows for the possibility that (indeed, is based on the assumption that) electrons are located in orbitals that span the entire molecule. Understanding spectroscopic behavior is an additional exclusive strength of MO theory, because of its intimate relationship with molecular symmetry and its ability to actually predict electronic transitions.
So, were I to sum it up generally in a sentence: VB theory is a good qualitative approach that does a very good job of describing molecular structure and reactivity in most cases, whereas MO theory does a better job when quantitative predictions of molecular characteristics are important. MO theory also succeeds in some specific situations that VB is simply not well suited, such as open shell or parametric molecules, geometries when lone pairs are involved, and so on.